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METHOD FOR PURIFYING KERATINOCYTE GROWTH FACTORS 
Field of the Invention 

5 The present invention relates to the field of 

protein purification. Specifically, the present 
invention relates to the field of purifying keratinocyte 
growth factors. 

10 - * Background of the Invention 

Polypeptide growth factors axe important 
mediators of intercellular communication (Rubin et al. 
(1989), Proc. Natl. Acad. Sci. USA, ££:802-806) . These 

15 molecules are generally released by one cell type and 
act to influence proliferation of other cell types. 

One family of growth factors is the fibroblast 
growth factors (FGF) . There are currently eight known 
FGF family members which share a relatedness among 

20 primary structures: basic fibroblast growth factor, 
bFGF (Abraham et al. (1986), EMBO J. , 5:2523-2528) ; 
acidic fibroblast growth factor, aFGF (Jaye et al. 
(1986), Science, 211:541-545); int-2 gene product, int-2 
(Dickson & Peters (1987), Nature, 22£:833); hst/kFGF 

25 (Delli-Bovi et al. (1987), Cell, 50:729-737, and 
Yoshida et al. (1987), Proc. Natl. Acad. Sci. USA, 
31:7305-7309) ; FGF- 5 (Zhan et al. (1988) , Hoi. Cell. 
Biol., 1:3487-3495); FGF- 6 (Maries et al. (1989), 
Oncogene, 4:335-340); keratinocyte growth factor (Finch 

30 et al. (1989), Science, 21:752-755; Rubin et al. (1989), 
Proc. Natl. Acad. Sci. USA, flfi: 802-806; Ron et al. 
(1993), The Journal of Biological Chemistry, 
268(41 :2984-2988; and Yan et al. (1991) , In Vitro Cell. 
Dev. Biol., 27^:437-438); and hisactophilin (Habazzettl 

35 et al. (1992), .Mature, 252:855-858). 
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Among the FGF family of proteins, keratinocyte 
growth factor (KGF) is a unique effector of non- 
fibroblast epithelial (particularly keratinocyte) cell 
proliferation derived from mesenchymal tissues. The 
5 term "native KGF" refers to a natural human (hKGF) or 
recombinant (rKGF) polypeptide (with or without a signal 
sequence) as depicted by the amino acid sequence 
presented in SEQ ID NO: 2 or an allelic variant thereof. 
[Unless otherwise indicated, amino acid numbering for 

10 molecules described herein shall correspond to that 
presented for the mature form of the native molecule 
(i.e., minus the signal sequence), as depicted by amino 
acids 32 to 194 of SEQ ID NO:2.] 

Native KGF may be isolated from natural 

15 sources. For example, hKGF can be isolated from medium 
conditioned by an embryonic lung fibroblast cell line 
(Rubin et al.(1989), supra. Three chromatographic 
steps, namely heparin-Sepharos^ 01 (Pharmacia, Piscataway, 
NJ) affinity chromatography, HPLC gel filtration, and 

20 reverse-phase HPLC, were used to obtain a purified hKGF 
preparation. Approximately 6 mg of hKGF were recovered 
from 10 liters of conditioned medium. These 
chromatographic steps only recovered 0.8% total hKGF 
based upon a mi to genie activity assay. A further 

25 example teaches the use of another chromatographic step 
using heparin- Sephar os e 1 * affinity and Mono-S™ ion- 
exchange chromatography^ (Pharmacia, Piscataway, NJ) for 
isolation of rKGF produced in bacteria (Ron et al. 
(1993), Journal of Biological Chemistry, 224:2984-2988). 

30 The properties of keratinocyte growth factors 

suggest a potential for the application thereof as a 
drug for promoting specific stimulation of epithelial 
cell growth. It therefore would be desirable to develop 
a method or methods for obtaining relatively high levels 

35 of homogeneous keratinocyte growth factors to provide 
sufficient quantities of material for comprehensive in 
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vitro and in vivo biological evaluation and for a 
potential therapeutic application. 

It is the object of this invention to provide 
a novel method for the purification of keratinocyte 
5 growth factors. 



Summary of the Invention 

The present invention is directed to a first 
10 method for purifying a keratinocyte growth factor (KGF) , 
the method comprising: 

a) obtaining a solution containing KGF; 

b) binding KGF from the solution of part (a) 
to a cation exchange resin; 

15 c) eluting KGF in an eluate solution from 

the cation exchange resin; 
d) passing the eluate solution from part (c) 
through a molecular weight exclusion 
matrix; and 

20 e) recovering KGF from the molecular weight 

exclusion matrix. 

The^ invention is further directed to a second 
method for purifying a keratinocyte growth factor (KGF) , 
25 the method comprising: 

* 

a) obtaining a solution containing KGF; 

b) binding KGF from the solution of part (a) 
to a cation exchange resin; 

c) eluting KGF in an eluate solution from 
30 the cation exchange resin; 

d) performing hydrophobic interaction 
chromatography on the eluate solution of 
part (c) ; and 

e) recovering KGF from the hydrophobic 

35 interaction chromatography step of part 

(d). 
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Generally, the cation exchange chromatography 
step of the first or second methods may be conducted 
with any suitable buffer (e.g., phosphate buffer saline, 
sodium acetate or tris-HCL) at a pH of preferably 
5 between about 6.8-7.5. Suitable columns for use in this 
step include carboxymethyl cellulose, car boxyme thy 1 
agarose and sulfated agarose and cellulose columns 
(e.g. , columns of S-Sepharose Fast Flow™ resin, Mono-S™ 
resin and CM-cellulose B * resin, commercially available 

10 from Pharmacia, Piscataway, NJ) . The flow rate will be 
variable depending upon the column size. 

The gel filtration step of the first method may be 
conducted in any suitable buffer (e.g., phosphate buffer 
saline) at a pH of preferably between about 7.0 and 7i5. 

15 Suitable columns for use in this step include agarose- 
based, acrylamide-based, silica-based or polymer-based 
size-exclusion columns (e.g.> columns of Sephadex G-75™ 
resin and Superdex-75 1 * resin, commercially available 
from Pharmacia) . 

20 In a particularly preferred embodiment of the 

second method, free sulfhydryl groups may be oxidized 
prior to the hydrophobic interaction step, discussed 
below. -Any manner of oxidation may be employed. For 
example, the protein may be exposed to atmospheric 

25 oxygen for a suitable period of time. Alternatively, 
various oxidation procedures may be ...employed. One such 
procedure is particularly suited for keratinocyte growth 
factors wherein one or more cysteine residues, as 
compared to the native KGF molecule, are deleted or 

30 replaced. In this procedure an oxidizing agent (e.g., 
cyst amine dihydrochloride or another appropriate 
oxidizing agent, for instance, cystine, oxidized 
glutathione or divalent copper) may be added to a fi n al 
concentration, adjusting the pH to preferably between 

35 about 7-9.5, with pH 9.0 + 0.3 'C being more preferred 
when using cystamine dihydrochloride) , and holding the 



WO 96/11952 



- 5 - 



PCT/US93/13099 



temperature at preferably between about 10-30 *C, for an 
appropriate period. The second procedure may be used 
for oxidizing native KGF and other keratinocyte growth 
factors with comparable patterns of cysteine residues. 
5 In this procedure, oxidation may be accomplished by 
adding an appropriate amount of an ionic strength 
modifier (e.g., (NH^SC^)), adjusting the pH to 
preferably between about 7.5-9.5, and holding the 
temperature at preferably between about 23 ± 5*C for an 

10 appropriate period. 

The hydrophobic interaction step of the second 
method may be conducted by using any suitable buffer 
(e.g., sodium phosphate) at a pH of preferably between 
about 6.0-8.0, more preferably about 7.0, and by eluting 

15 with a decreasing linear (NH4)2S04 gradient ranging from 
2-0 M. Suitable columns for use in this step include 
alkyl or phenyl substituted resins • (e.g.., a column of 
Butyl-650M Toyopearl™ resin, commercially available from 
Tosohaas, Inc., Montgomeryville, PA and columns of 

20 phenyl Sepharose 1 " resin and phenyl Superose™ resin, 
commercially available from Pharmacia) . 

The process of the present invention may be 
used to purify KGF. Thus, it should be understood that 
the terms "keratinocyte growth factor" and "KGF" as 

25 employed in this description ate intended to include, 
and to mean interchangeably unless otherwise indicated, 
native KGF and KGF analog proteins (or "muteins") 
characterized by a peptide sequence substantially the 
same as the peptide sequence of native KGF and by 

30 retaining some or. all of the biological activity of 

native KGF, particularly non- fibroblast epithelial cell 
proliferation (e.g., exhibiting at least about 500-fold 
greater stimulation of BALB/MK keratinocyte cells than 
that of NIH/3T3 fibroblast cells, and at least about 50- 

35 fold greater stimulation of BALB/MK keratinocyte cells 
hh^n for BS/589 epithelial cells or for CC1208 
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epithelial cells, as determined by H- thymidine 
incorporation) . By "characterized by a peptide sequence 
substantially the same as the peptide sequence of native 
KGF* is meant a peptide sequence which is encoded by a 
5 DNA sequence capable of hybridizing to nucleotides 201 
to 684 of SEQ ID NO:l, preferably under stringent 
hybridization conditions. 

The determination of a corresponding amino 
acid position between two amino acid sequences may be 

10 determined by aligning the two sequences to maximize 

matches of residues including shifting the amino and/or 
carboxyl terminus, introducing gaps as required and/or 
deleting residues present as inserts in the candidate. 
Database searches, sequence analysis and manipulations; 

15 may be performed using one of the well-known and 
routinely used sequence homology/ identity scanning 
algorithm programs (e.g., Pearson and Lipman (1988), 
Froc. Natl. Acad. Sci. U.S.A., jJ5.:2444-2448; Altschul et 
al. (1990), J. Mol. Biol., 215:403-410; Lipman and 

20 Pearson (1985), Science, 222 : 1435 or Devereux et al. 
(1984), Nuc. Acids Res., 12:387-395). 

Stringent conditions, in the hybridization 
context, -will be stringent combined conditions of salt, 
temperature, organic solvents and other parameters 

25 typically controlled in hybridization reactions. 
Exemplary stringent hybridization conditions* are 
hybridization in 4 X SSC at 62-67° C, followed by 
washing in 0.1 X SSC at 62-67° C. for approximately an 
hour. Alternatively, exemplary stringent hybridization 

30 conditions are hybridization in 45-55% formamide, 4 X 
SSC at 40-45°C. [See, T. Man i at is et. al., Molecular 
Cloning (A Laboratory Manual) ; Cold Spring Harbor 
Lab ratory (1982), pages 387 to 389]. 

Thus, the proteins include allelic variations, 

35 or deletion(s), substitution (s) or insertion(s) of amino 
acids, including fragments, chimeric or hybrid molecules 
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of native KGF. One example of KGF includes proteins 
having residues corresponding to Cys 1 and Cys 15 of SEQ ID 
NO: 2 replaced or deleted, with the resultant molecule 
having improved stability as compared with the parent 
5 molecule (as taught in commonly owned U.S. S.N. 

08/487,825, filed on July 7, 1995). Another example of 
KGF includes charge-change polypeptides wherein one or 
more of amino acid residues 41-154 of native KGF 
(preferably residues Arg 41 , Gin 43 , Lys 55 , Lys 95 , Lys 128 , 

10 Asn 137 , Gin" 8 , Lys 139 , Arg 144 , Lys* 4 ?, Gin"*, " Lys l53 or 

Thr 154 ) are deleted or substituted with a neutral residue 
or negatively charged residue selected to effect a 
protein with a reduced positive charge (as taught in 
commonly owned U.S. S.N. 08/323,337, filed on October 13, 

15 1994) . A still further example of KGF includes proteins 
generated by substituting at least one amino acid having 
a higher loop- forming potential for at least one amino 
acid within a loop- forming region of Asn^^-HisllS. 
Tyr 117 -Asn 118 -Thr 119 of native KGF (as taught in 

20 commonly owned U.S. S.N. 08/323,473, filed on October 13, 
1994) . • A still yet further example includes proteins 
having one or more amino acid substitutions, deletions 
or additions within a region of 123-133 (amino acids 
154-164 of SEQ ID NO:2) of native KGF; these proteins 

25 may have agonistic or antagonistic activity. 

Specifically disclosed proteins include the 
following KGF molecules (referred to by the "residue 
found at that position in the mature protein (minus 
signal sequence) set forth in SEQ ID NO:2, followed by 

30 that amino acid position in parentheses and then either 
the substituted residue or ■-■ to designate a deletion) : 
C(1,15)S, AN15-AN24, AN3/C(15)S, AN3/C(15)-, AN8/C(15)S, 
AN8/C(15)-, C(1,15)S/R(144)E, C(l, 15) S/R(144)Q f 
AN23/R(144)Q, C(1,15,40)S, C (1, 15, 102) S, 

35 C(l, 15,102, 106)S, AN23/N(137)E, AN23/K(139)E, 

AN23/K(139)Q, AN23/R(144) A, AN23/R(144) E, AN23/R(144)L, 
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AN23/K(147)E, AN23/K(147)Q, AN23/K(153)E, AN23/K(153 ) Q, 
AN23/Q(152)E/K(153)E; R(144)Q and H(116)G. 

As those skilled in the art will also 
appreciate, a variety of host-vector systems may be 
5 utilized to express the KGF protein-coding sequence. 
These include but are not limited to eucaryotic cell 
systems such as mammalian cell systems infected with 
virus (e.g., vaccinia virus adenovirus , etc.); insect 
cell systems infected with virus (e.g., baculovirus) ; 

10 microorganisms such as yeast-containing yeast vectors; 
or to procaryotic cell systems such as bacteria 
transformed with bacteriophage DNA, plasmid DNA, or 
cosmid DNA. The expression elements of these vectors 
vary in their strengths and specificities. Depending , on 

15 the host-vector system utilized, any one of a number of 
suitable transcription and translation elements may be * 
used. 

Once the protein product of KGF expression has 
been isolated, purified and assayed for KGF activity 

20 (using procedures known to those skilled in the art) , it 
may be formulated in a variety of pharmaceutical 
compositions . Typically, such compositions include a 
suitable*/ usually chemically-defined, carrier or 
excipient for the therapeutic agent and, depending on 

25 the intended form of administration, other ingredients 
as well. The composition can include aqueous carriers 
or consist of solid phase formulations in which KGF is 
incorporated into non-aqueous carriers such as 
collagens, hyaluronic acid, and various polymers. The 

30 composition can be suitably formulated to be 

administered in. a variety of ways, including by 
injection, orally, topically, intranasally and by 
pulmonary delivery. 
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Brief Description of the Drawings 

Figure 1 shows the nucleotide (SEQ ID N0:1) 
and amino acid (SEQ ID NO: 2) sequences of native KGF 
5 (the nucleotides encoding the mature form of native KGF 
is depicted by bases 201 to 684 of SEQ ID NO:l and the 
mature form of KGF is depicted by amino acid residues 32 
to 194 of SEQ ID N0:2) . 

Figures 2A r 2B and 2C show the plasmid maps of 
10 pCFM1156, *.pCFM1656 and pCFM3102, respectively. 

Figure 3 shows the nucleotide (SEQ ID NO:3) 
and amino acid (SEQ ID NO: 4) sequences of the construct 
RSH-KGF. 

Figure 4 shows the nucleotide (SEQ ID N0:5) 
15 and amino acid (SEQ ID NO: 6) sequences of the construct 
contained in plasmid KGF. 

Figure 5 shows the chemically synthesized 
OLIGOs (OLIGO#6 through OLIGO#ll; SEQ ID NO: 12-17, 
respectively) used to substitute the DNA sequence 
20 between a Kpnl site and an EcoRI site for a Kpnl site 
(from amino acid positions 46 to 85 of SEQ ID No: 6) in 
the construct contained plasmid KGF to produce the 
construct in plasmid KGF (dsd) . 

Figure 6 shows the chemically synthesized 
25 OLIGOs (OLIGO#12 through OLIGO#24; SEQ ID NO:18-30, 
respectively) used to construct KGF(codon optimized) . 

Figure 7 shows the nucleotide (SEQ ID NO: 31) 
and amino acid sequences (SEQ ID NO:32) of C(1,15)S, a 
KGF analog having substitutions of serine for cysteine 
30 at amino acid positions 1 and 15 of native KGF. 

Figure 8 shows the nucleotide (SEQ ID NO: 33) 
and amino acid sequences (SEQ ID NO: 34) of 
C(1,15)S/R(144)E, a KGF analog having substitutions of 
serine for cysteine at amino acid positions 1 and 15 and 
35 a substitution of glutamic acid for arginine at amino 
acid position 144 of native KGF. 
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Figure 9 shows the nucleotide (SEQ ID NO: 35) 
and amino acid (SEQ ID NO: 36) sequences of 
C(1,15)S/R(144)Q, a KGF analog having substitutions of 
serine for cysteine at amino acid positions 1 and 15 and 
5 a substitution of glutamine for arginine at amino acid 
position 144 of native KGF. 

Figure 10 shows the nucleotide (SEQ ID NO: 37) 
and amino acid (SEQ ID NO: 38) sequences of AN15, a KGF 
analog having a deletion of the first 15 amino acids of 
10 the- N- terminus of native KGF. 

Figure 11 shows the nucleotide (SEQ ID NO: 39) 
and amino acid (SEQ ID NO: 40) sequences of AN23 , a KGF 
analog having a deletion of the first 23 amino acids of 
the N- terminus of native KGF. ; 
15 Figure 12 shows the nucleotide (SEQ ID NO: 41) 

and amino acid (SEQ ID NO: 42) sequences of AN23/R(144)Q, 
a KGF analog having a deletion of the first 23 amino 
acids of the N- terminus and a substitution of glutamine 
for arginine at amino acid position 144 of native KGF. 

20 

Description of Specific Embodiments 

; : Standard methods for many of the procedures 
described in the following examples, or suitable 

25 alternative procedures, are provided in widely 

recognized manuals of molecular biology such as, for 
example, Molecules Cloning, Second Edition, Sambrook et 
al., Cold Spring Harbor Laboratory Press (1987) and 
Current Protocols in Molecular Biology, Ausabel et al., 

30 Greene Publishing Associates /Wiley-Interscience, New 
York (1990).. 

Example 1 = Preparation of DNA Coding for KGF and KGF Analogs 

35 The cloning of the full-length human KGF gene 

(encoding a polypeptide with the sequence of native KGF) 
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was carried out both by polymerase chain reaction (PCR) 
of RNA from an animal cell and by PCR of chemically 
synthesized (E. coli optimized codon) oligonucleotides 
("OLIGOs") . Both procedures are described below.: 
5 PCR amplification using RNA isolated from 

cells known to produce the polypeptide was performed. 
Initially, cells from a human fibroblast cell line 
AG1S23A (obtained from Human Genetic Mutant Cell Culture 
Repository Institute For Medical Research, Camden, New 

10 Jersey) were disrupted with guanidium thiocyanate, 
followed by extraction {according to the method of 
Chomyzinski et al. (1987), Anal. Biochem. , 122:156). 
Using a standard reverse transcriptase protocol for 
total RNA, the KGF cDNA was generated. PCR (PCRtl) 

15 amplif ication of the KGF gene was carried out using the 
KGF cDNA as tenplate and primers OLIGO#l and OLIGO#2 
that encode DNA sequences immediately 5' and 3' of the 
KGF gene [model 9600 Thermocycler (Perkin-Elmer Cetus, 
Norwalk, CT) ; 28 cycles; each cycle consisting of one 

20 minute at 94°C for denaturation, two minutes at 60°C for 
annealing, and three minutes at 72°C for elongation] . A 
small aliquot of the PCR#1 product was then used as 
template for a 'second KGF PCR (PCR#2) amplification 
identical to the cycle conditions described above except 

25 for a 50°C annealing temperature . For expression 

cloning of the KGF gene, nested PCR primers were used to 
create convenient restriction sites at both ends of the 
KGF gene. OLIGO#3 and OLIGO#4 were used to modify the 
KGF DNA product from PCR#2 to include Mini and BamHI 

30 restriction sites at the 5 1 an£ 3 ' ends of the gene, 

respectively [PCR#3; 30 cycles; each cycle consisting of 
one minute at 94°C for denaturation, two minutes at 60°C 
for annealing, and three minutes at 72°C for 
elongation] . This DNA was subsequently cut with Mini 

35 and BamHI, phenol extracted and ethanol precipitated. 
It was then resuspended and ligated (using T4 ligase) 
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into a pCFKL156 plasmid (Figure 2A) that contained a 
•RSH" signal sequence to make construct RSH-KGF (Figure 
3). The ligation products were transformed (according 
to the method of Hanahan (1983), J. Mol. Biol., 166 :557) 
5 into E. coli strain FM5 (ATCC: 53911) and plated onto 
LB+kanamycin at 28°C. Several transformants were 
selected and grown in small liquid cultures containing 
20 jig/mL kanamycin. The RSH-KGF plasmid was isolated 
from the cells of each culture and DNA sequenced. 

10 Because of an internal Ndel site in the KGF gene,, it was 
not possible to directly clone the native gene sequence 
into the desired expression vector with the bracketed 
restriction sites of Ndel and BamHI. This was 
accomplished as a three-way ligation. Plasmid RSH-KGF. 

15 was cut with the unique restriction sites of BsmI and 

SstI, and a -3 kbp DNA fragment (containing the 3 • end * 
of the KGF gene) was isolated following electrophoresis 
through a 1% agarose gel. A PCR (PCR#4) was carried out 
as described for PCR#3 except for the substitution of 

20 OLIGO#5 for 0LIG0#3 . The PCR DNA product was then cut 
with Ndel and BsmI and a 311 bp DNA fragment was 
isolated following electrophoresis through a 4% agarose 
gel . The : third fragment used in the ligation was a 1 . 8 
kbp DNA fragment of pCFM1156 cut with Ndel and 5s t J 

25 isolated following electr'ophoresis through a 1% agarose 
gel. Following ligation (T4 ligase) , transformation, 
kanamycin selection and DNA sequencing, as described 
above; a clone was picked containing the construct in 
Figure 4, and the plasmid designated KGF. Because of an 

30 internal ribosomal binding site that produced truncated 
products, the KGF DNA sequence between the unique Kpnl 
and EcoRI sites was replaced with chemically synthesized 
OLIGOs (OLIGO#6 through 0LIG0#11) to minimize the use of 
the internal start site (Figure 5) . 



35 
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OLIGO* 1 (SZQ ID N0:7) : 5 ' -CAATGACCTAGGAGTAACAATCAAC-3 ' 

OLIGO#2 (SEQ ID NO : 8 ) : 5 1 -AAAACAAACATAAATGCACAAGTCCA-3 • 

OLIGO#3 {SEQ ID NO : 9 ) : 5 1 -ACAACGCGTGCAATGACATGACTCCA-3 * 

OLIGO#4 (SEQ ID NO:10) : 
5 5 1 -ACAGGATCCTATTAAGTTATTGCCATAGGAA-3 ■ 

OLIGO#5 (SEQ ID NO: 11) : 

5 * - ACACATATGTGCAATGACATGACTCCA- 3 • 
OLIGO#6 (SEQ ID NO: 12): 

5 ' -CTGCGTATCGACAAACGCGGCAAAGTCAAGGGCACCC- 3 • 

10 OLIGO#7 (SEQ ID NO:13): 

5 • - AAGAGATGAAAAACAACTACAATATTATGGAAATCCGTACTGTT- 3 ' 
OLIGO#8 (SEQ ID NO:14): 

5 * -GCTGTTGGTATCGTTGCAATCAAAGGTGTTGAATCTG-3 • 
OLIGO#9 (SEQ ID NO:15): 
15 5 ■ -TCTTGGGTGCCCTTGACTTTGCCGCGTTTGTCGATACGCAGGTAC-3 * 

OLIGOflO (SEQ ID NO: 16) : 

5 • -ACAGCAACAGTACGGATTTCCATAATArTGTAGTTGTTTTTCATC-3 1 

- OLIGOtll (SEQ ID NO:17) : 

5 • - AATTCAGATTCAACACCTTTGATTGCAACGATACCA- 3 ' 

20 

The OLIGOs were phosphorylated with T4 
polynucleotide kinase and then heat denatured* The 
single-strandeS- (ss) OLIGOs were then allowed to form a 
ds DMA fragment by allowing the temperature to slowly 
25 decrease to room temperature. 'T4 ligase was then used 
to covalently link both the internal OLIGO sticky-ends 
and the whole ds OLIGO fragment to the KGF plasmid cut 
with Kpnl and EcoRI. The new plasmid was designated 
KGF(dsd). 

30 A completely <E. eoli. codon-optimized KGF gene 

was constructed by PCR amplif ication of chemically 
synthesized OLIGOs #12 through 24. 
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0LIGO#12 (SEQ ID NO: 18) : 5 ' -ACTTTTGATCTAGAAGGAGG-3 ' 
OI*IGO#13 (SEQ ID NO:19) : 5 ' -TCAAAACTGGATCCTATTAA-3 • 
OI*IGO#14 (SEQ ID NO:20) : 

5 ' -A GTrrr GATCTAGAAGGAGGAATAACATATGTGCAACGACATGAC* 
5 TCCGGAACAGATGGCTACCAACGTTAACTGCTCCAGCCCGGAACGT- 3 • 

OLIGO#15 (SEQ ID NO:21) : 

5 ' - cacacccgtagctacgactacatggaaggtggtgacatc cgtgttc - 
gtcctctcttctccgtaccc^^ • 

0LIG0U6 (SEQ ID N0:22) : 
10 5 1 "CGTGGTAAAGTTAAAGGTACCCAGGAAATGAAAAACAACTA* 

CAACATCATGGAAATCCGTACTCTTGCrGTTGGTATC 3 ' 

OLIGO#17 (SEQ ID NO:23): 

5 ' -GGTGTTGAATCTGAATTCTACCTGGCAATGAACAAAGAAGGTAAAC- 
TGTACGCAAAAAAAGAATGCAACGAAGACTGCAACTTCAAAGAA- 3 * 
15 OLIGOiia (SEQ ID NO: 24) : 

5 * "CTGATCCTGGAAAACCACTACAACACCTACGCATCTGCTAAATGGA-- . 
CCCACAACGGTGGTGAAATGTTCGTTGCTCTGAACCAGAAAGGT-3 ' 

0LIG0#19 (SEQ ID N0:2S) : 

5 ' -ATCCCGGTTCGTGGTAAAAAAACCAAAAAAGAACAGAAAACCGCT- 

2 0 CACTTCCTGCCGATGGCAATCACTTAATAGGATCCAGTTTTGA- 3 • 

OLIGO#20 ( SEQ ID NO : 2 6 ) : 5 ' -TACGGGTGTGACGTTCCGGG-3 * 
OI.IGO#21 (SEQ ID NO : 27 ) : 5 • -CTTTACCACGTTTGTCGATA-3 ' 
OLIGO#22 : ( SEQ ID NO: 28) :5 ' - ATTCAACACCTTTGATTGCA- 3 ' 
OLIGO#23 ( SEQ ID NO : 29 ) : 5 1 -CCAGGATCAGTTCTTTGAAG-3 1 

25 OLIGO#24 (SEQ ID NO : 30 ) : 5 ' tGAACCGGGATACCTTTCTGG-3 * 

OLIGOs #12 through 24 were' 'designed so that 
the entire DMA. sequence encoding native KGF was 
represented by OLIGOs from either the "Watson" or the 

30 "Crick" strand and upon PGR amplification would produce 
the desired double-stranded DNA sequence (Figure 6) 
[FCR#5, Model 9600 thermocycler (Perkin-Elmer Cetus) ; 21 
cycles, each cycle consisting of 31 seconds at 94°C for 
denaturation, 31 seconds at 50°C for annealing, and 31 

35 seconds at 73°C for elongation; following the 21 cycles 
the PCR was finished with a final elongation step of 7 
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minutes] . After PCR anplif ication, the DNA fragment was 
cut with Xbal and BamHI and the 521 bp fragment ligated 
into the expression plasmid pCFMll56 cut with the same 
enzymes. PCR#5 utilized the outside primers 
5 (100 pmoles/100 jil rxn) OLIGO#12 and OLIGO#13 and 

1 yl/100 \xl rxn of a KGF template derived by ligation 
(by T4 ligase) of OLIGOs #14 through #19 (OLIGOs#15 
through OLIGOs#i8 were phosphorylated with T4 
polynucleotide kinase) using OLIGOs#20 through OLIGOs #2 4 
10 as band-aid oligos (Jayaraman et al. (1992), 

Biotechnigues, 12:392) for the ligation. The final 
construct was designated KGF (codon optimized) . 

All of the KGF analogs described herein are 
composed in part from DNA sequences found in KGF(dsd) or 
15 KGF (codon optimized), or a combination of the two. The 
sequences are further modified by the insertion into 
convenient restrictions sites of DNA sequences that 
encode the particular KGF analog amino acids made 
utilizing one or more of the above-described techniques 
20 for DNA. fragment synthesis. Any of the analogs can be 
generated in their entirety by either of the above 
described techniques. However, as a part of the general 
OLIGO design optimized E. coli codons were used where 
appropriate, although the presence of E. coli optimized 
25 codons in part or in to to of any of the genes where 
examined did not significantly increase the yield of 
protein that could be obtained from cultured bacterial 
cells. Figures 7 to 12 set forth by convenient example 
particular KGF analog nucleotide and amino acid sequence 
30 constructions: C(1,15)S (Figure 7); C(1,15)S/R(144)E 
(Figure 8); C(1,15)S/R(144)Q (Figure 9); AN15 (Figure 
10); AN23 (Figure 11) and AN23/R(144)Q (Figure 12). All 
the KGF analog constructions described herein were DNA 
sequence confirmed. 
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Trample 2 r Purification from E. coli 

Three different expression plasmids were 

5 utilized in the cloning of the KGF analog genes. They 

were pCFM1156 (ATCC 69702), pCFM16S6 (ATCC 69576), and 

pCFM3102 (Figures 2A, 2B and 2C, respectively) . The 

plasmid p3102 can be derived from the plasmid pCFM1656 

by making a series of site directed base changes with 

10 PCR*. overlapping oligo mutagenesis. Starting with the 

Bglll site (pCFM1656 plasmid bp *180) immediately 5 1 to 

the plasmid replication promoter, PcopB/ proceeding 

toward the plasmid replication genes, the base pair 

changes are as follows: 
15 ; 
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nrFM!656 bo # bo in OCFM1656 bo changed to in nfFMTf Qfl 







# 


204 


T/A 


C/G 






# 


428 


A/T 


G/C 






# 


509 


G/C 


^ /rn 
A/ 1 


5 




# 


617 




insert two G, 






# 


677 


G/C 


T/A 






# 


978 










# 


992 


G/C 


A/T 






# 


1002 


A/T 


C/G 






# 


1005 




T/A 






# 


1026 


A/T 


T/A 






# 


1045 


C/G 


T/A 






# 


1176 


G/C 


T/A 






# 


1464 ■ 




■ T/A 


15 




# 


2026 


G/C 


bp deletion 






# 


2186 


C/G 


T/A- 






# 


2479 


A/T 


T/A 




# 


2498-2501 


AGTG 


GTCA 


20 








TCAC 


CAGT 




# 


2641-2647 


TCCGAGC 


bp deletion 










AGGCTCG 




25 


# 


3441 


G/C 


: A/T 




# 


3452 


G/C 


A/T 




# 


3649 


A/T 


T/A 






4556 




insert bps 



( SEQ ID NO : 44 ) 5 1 -CTCGAGTGATCACAGCTGGACGTC-3 



As seen above, pCFM1156, pCFM1656 and pCFM3102 
are very similar to each other and contain many of the 

35 same restriction sites. The pl asm ids were chosen by 

convenience, and the vector UNA components^ can be easily 
exchanged for purposes of new constructs. The host used 
for all cloning was E. coli strain FM5 (ATCC: 53911) and 
the transformations were carried out . (according to the 

40 method of Hanahan (1983), supra) or by electroelution 
with a Gene Pulser*" transfection apparatus (BioRad 
Laboraties, Inc., Hercules, CA) , according to the 
manufacturer's protocol. 

Initially, a small, freshly-cultured inoculum 

45 of the desired recombinant E. coli clone harboring the 
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desired construct on one of the three pCFM vectors was 
started by transferring 0.1 mL of a frozen glycerol 
stock of the appropriate strain into a 2 L flask 
containing 500 mL of Luria broth. The culture was 
5 shaken at 30 *C for 16 hours. Thereafter the culture was 
transferred to a IS L fermentor containing 8 L of 
sterile batch medium (Tsai, et al. (1987) , J\ Industrial 
Microbiol., 2:181-187). 

Feed batch fermentation starts with the 

10 feeding of Feed # 1 medium (Tsai, et aL (1987.), 

supra). When the OD600 reached 35 , expression of the 
desired KGF analog was induced by rapidly raising the 
culture temperature to 37*C to allow the amplification 
of plasmid. After two hours at 37 *C, the culture 

15 temperature was quickly raised to 42 *C to denature the 

CI repressor and the addition of Feed 1 was discontinued' 
in favor of Feed 2, the addition rate of which was 
initiated at 300 mL/hr. Feed 2 comprised 175 g/L 
trypticase-peptone, 87.5 g/L yeast extract, and 260 g/L 

20 glucose. After one hour at 42 *C, the culture 
temperature was decreased to 36 *C, where this 
. temperature was then maintained for another 6 hours . 

c.The fermentation was then halted and the cells 
were harvested by centrifugation into plastic bags 

25 placed within 1 L centrifuge bottles. The cells were 
pelleted by centrifugation at 400 rpm for 60 minutes, 
after which the supernatants were removed and the cell 
paste frozen at -90 # C. 

Following expression of the various KGF 

30 analogs in E. coli, native KGF , C(1,15)S, 

C(1,15)S/R(144)E, C(1,15)S/R(144)Q, AN15, 4N23, and 
AN23/R(144)Q protein were purified using the following 
procedure. Cell paste from a high cell density 
fermentation was suspended at 4*C in 0.2 M NaCl, 20 mM 

35 NaP0 4 , pH 7.5 as a 10-20% solution (weight per volume) 
using a suitable high shear mixer. The suspended cells 
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were then lysed by passing the solution through a 
homogenizer (APV Gaul in, Inc., Everett, MA) three times. 
The outflowing homogenate was cooled to 4-8 *C by using a 
suitable heat exchanger. Debris was then removed by 
5 centrifuging the lysate in a J-6B 1 * centrifuge (Beclcnan 
Instruments, Inc., Brea, CA) equipped with a JS 4.2 
rotor at 4,200 rpm for 30-60 min. at 4*C. Superoatants 
were th^n carefully decanted and loaded onto a 
previously prepared 450 mL (5 cm x 23 cm) column of S- 

10 Sepharose Fast Flow™ resin (Pharmacia) column 

equilibrated with 0.2 M NaCl, 20 mM NaP0 4 , pH 7.5 at 
4 # C. Next, the column was washed with five column 
volumes (2250 mL) of 0.4 M NaCl, 20 mM NaP0 4 , pH 7.5 at 
4*C. The desired protein was eluted by washing the 

15 column with 5 L of 0.5 M NaCl, 20 mM NaP04, pH 7.5. 

Again, 50 mL fractions were collected and the A28O of the 
effluent was continuously monitored. Fractions 
identified by A28O as containing eluted material were 
then analyzed by SDS-PAGE through 14% gels to confirm 

20 the presence of the desired polypeptide. 

Those fractions containing proteins of 
interest were then pooled, followed by the addition of 
an equal volume of distilled water. The diluted sample 
was then loaded onto a previously prepared 450 mL (5 cm 

25 x 23 cm) column of S-Sepharose'Fast Flow equilibrated 

with 0.4 M NaCl, 20 mM NaP0 4 , pH 6.8 at 4*q. The column 
was washed with 2250 mL of 0.4 M NaCl, 20 mM NaP0 4 , pH 
6.8 and the protein eluted using a 20 column volume 
linear gradient ranging from 0.4 M NaCl, 20 mM NaP04, pH 

30 6.8 to 0.6 M NaCl, 20 mM NaFO^ pH 6.8. Again, 50 mL 
fractions were collected under constant A28O monitoring 
of the effluent. Th se fractions containing the protein 
(determined by 14% SDS-PAGE) were then pooled, followed 
by concentration through a YM-10 membrane (10,000 

35 molecular weight cutoff) in a 350cc stirring cell 

(Amicon, Inc. Mayberry, MA) t a volume of .30-40 mL. 
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The concentrate was then loaded onto a 
previously generated 1,300 mL (4.4 cm x 85 cm) column of 
Superdex-75 31 resin (Pharmacia) equilibrated in column 
buffer comprising IX PBS (Dulbecco's Phosphate Buffered 
5 Saline, "D-PBS, • calcium and magnesium- free) or 0.15 M 
NaCl, 20 mM NaP04, pH 7.0. After allowing the sample to 
run into the column, the protein was eluted from the gel 
filtration matrix using column buffer. Thereafter, 
10 mL fractions were recovered and those containing the 

10 analog (determined by 14% SDS-PAGE) were pooled. 

Typically, the protein concentration was about 5-10 
mg/xoL in the resultant pool. All of the above 
procedures were performed at 4-8 unless otherwise 
specified. . 

15 An alternative purification procedure was used 

to purify native KGF, C(1,15)S and AN23 . The procedure - 
involves the following steps and, unless otherwise 
specified, all procedures, solutions and materials were 
conducted at 2.3 ± 5* C. 

20 Upon completion of the production phase of a 

bacterial fermentation, the cell culture was cooled to 
4-8TC and the cells were harvested by centrifugation or 
a similar* process . On the basis of the expected yield 
of protein per unit weight of cell paste and the amount 

25 of purified protein required, an appropriate amount of 
cell paste, by weight, was suspended in a mild buffer 
solution, 20 mM NaP0 4 , 0.2 M NaCl, pH~7.5, weighing 
about five times that of the cell paste to be suspended. 
The cells were dispersed to a homogeneous solution using 

30 a high shear mixer. The temperature of the cell paste 
dispersion was maintained at 4-8 *C during 
homogenizatiqn. 

The cells were then lysed by pressure, for 
example by passing the cell paste dispersion twice 

35 through an appropriately sized cell homogenizer. The 
homogenate was kept chilled at 5 ± 3°C. To clarify the 
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cell lysate, a previously prepared depth filter housing 
(Cuno, Inc., Meriden, CT) equipped with a filter having 
an appropriate amount of filter surface area, 
equilibrated with a suitable volume of 0.2 M NaCI, 20 mM 
S NaP0 4 , pH 7.5 was employed. The equilibration and 
clarification were performed at 5 ± 3 # C. Prior to 
clarification, an appropriate amount of a suitable 
filter aid was used to pre-coat the filter and be 
thoroughly mixed with the cell lysate, after which the 

10 lysate was clarified by passing the solution through the 
filter apparatus. The filter was washed with 0.2 M 
NaCI, 20 mM NaP0 4 , pH 7.5. The filtrate and any 
subsequent wash were collected in a chilled container of 
suitable capacity, all the while being m aintained at 

15 less than 10 *C. 

Following clarification the lysate was then 
passed through a previously prepared column of 
SP-Sepharose Fast Flow containing at least 1 raL of resin 
per 2 g of cell paste. The column of SP-Sepharose Feist 

20 Flow was equilibrated with cold (5 ± 3*C), 0.2 M NaCI, 
20 mM NaP04, pH 7.5. The ten^erature of the column was 
maintained at less than 10 # C. The clarified lysate 
(5 ± 3*C) was ^en loaded onto the ion exchange column, 
with the absorbance at 280 nm (A280) of eluate being 

25 continuously monitored. After" sample loading, the 
column was washed with cold 0.2 M NaCI, 20 mM NaP0 4 , 
pH 7.5, followed by washing with 0.3 M NaCI, 20 mM 
NaP0 4 , pH 7.5 at 23 + 5'C. 

To elute the desired protein, a linear 

30 gradient ranging from 0.2-1 M NaCI, 20 mM NaP0 4 , pH 7.5 
was used. Bulk product was collected in several 
fractions on the basis of the A28G of the eluate. 
Following elution, these fractions were pooled and the 
volume noted. 

35 To oxidize free sulfhydryl groups, an 

oxidation step was performed. For proteins with altered 
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cysteine patterns, as compared to native KGF, an 
oxidizing agent (e.g., cystamine dihydrochloride or 
another appropriate oxidizing agent, for instance, 
cystine, oxidized glutathione or divalent copper) was 
5 added to a final concentration of 1-20 mM and the pH was 
adjusted to 7-9.5, with a pH of 9.0 ± 0.3 when cystamine 
dihydrochloride was used. The oxidation was conducted 
at 10 - 30 # C for an appropriate period. For the native 
KGF protein, oxidation was accomplished by adding an 

10 appropriate amount of (NH4)2S04 such as 1-2 M (NH4)2S04, 
adjusting the pH to 7.5-9.5, and holding the temperature 
at 23 + 5 # C for an appropriate period. 

After oxidation, the pH of the solution was 
adjusted to between 6.5 and 9.5. If necessary, solid . 

15 (NH4)2S04 was added to the solution to a f ina l 

concentration of 2 M. To remove particulates, the 
solution was passed through appropriate clarification 
filters . 

The filtered, oxidized product was then 

20 subjected to hydrophobic interaction chromatography 

(HIC) . The HIC matrice was Butyl-650M Toyopearl 1 " resin 
(Tosohaas , Inc., Montgomeryville, PA). The protein- 
containing solution was loaded onto the column, which 
had been previously equilibrated with 2 M (NH4)2S04, 0.15 

25 M NaCl, 20 mM NaP0 4 , pH 7-.0. After sanple loading, the 
column was washed with 2 M (NH4)2S04, 0.15 M NaCl, 20 mM 
NaP04, pH 7.0. The desired protein was then eluted 
using a decreasing linear (NH4)2S04 gradient ranging from 
2-0 M developed in 0.15 M NaCl, 20 mM NaP04, pH 7.0. 

30 When the desired protein .began to elute, as indicated by 
an increase in the A28O of the eluate, fractions were 
collected. Aliquots of each fraction were then analyzed 
by SDS-PAGE. Those fractions containing the desired 
protein were then pooled, thoroughly mixed, and the 

35 volume of the pool determined, as was the concentration 
of the protein therein. 
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The pooled HIC protein-containing eluate was 
then concentrated and the elutidn buffer exchanged. 
Typically, proteins were concentrated to 5.0-10.0 mg/mL. 
Ultrafiltration was conducted using an ultrafiltration 
5 system equipped with a Pellicon** cassette system 

(Millipore, Inc., Bedford, MA) with an appropriately 
sized cut-off membrane 

After concentration, the sample was 
diafiltered against an appropriate buffer. The 
10 retentate from the concentration step was diafiltered 
against 0.15 M NaCl, 20 mM NaP0 4 , pH 7.0 until the 
conductivity of the retentate was within 5% of the 
conductivity of the 0.15 M NaCl, 20 mM NaP0 4 , pH 7.0 
solution. 

15 In addition, to remove precipitates and 

bacterial endotoxin that might be present, the 
concentrated diafiltered protein-containing sample was 
passed through a 0.1 fim residue 31 filter (Pall, Inc., 
Cortland, NY) . After determining the protein 

20 concentration of the solution and on the basis of the 
desired concentration of the final, bulk product, the 
solution was diluted with 0.15 M NaCl, 20 mM sodium 
phosphate, pH 7.0, to the desired final concentration. 
A final aseptic filtration through a 0.22 pm filter, was 

25 then performed as the final bulk product was transferred 
to a pyrogen- free container for storage (at about 5*C) 
for further formulation. 

Example 3 : Purification from Mammalian Cell Culture 

30 

This example describes the expression, 
isolation, and characterization of two biologically 
active recombinant KGF (rKGF) forms produced in a 
mammalian expression system. 
35 The human KGF gene was isolated by PCR 

amplif ication of cDNA made from normal dermal human 
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10 



15 



fibroblast cells (Clonetec, Inc., Palo Alto, CA). 
Following the making of cDNA by reverse transcriptase, 
PCR was used to amplify the KGF gene. 0LIG0#2S and 
OLIGO#26 were used to amplify the gene out of the cDNA 
and OLIGO#27 and OLIGO#28 were used to place HindLXZ and 
Bglll restriction sites at the fragment ends by a second 
PCR amplification, as set forth in Figure 1. 



0LIG0#25 (SEQ ID NO: 45) 

0LIG0#26 (SEQ ID N0:4S) 

OLIGO#27 (SEQ ID NO: 47) 

0LIG0#28 (SEQ ID NO: 48) 



5 ' -CAATCTACAATTCACAGA-3 * 

5 ' -TTAAGTTATTGCCATAGG-3 ' 

5 * - AACAAAGCTTCTACAATTCACAGATAGGA- 3 • 

5 ' -AACAAGATCTTAAGTTATTGCCATAGG-3 * 



Following cloning and DNA sequence 
confirmation, the KGF gene DNA was then used. 
Amplification was effected using two primers: 



0LIG0#29 (SEQ. ID. N0:49) : 

5 1 -CGGTCTAGACCACCATGCACAAATGGATACTGACATGG-3 ' 

20 OLIGO#30 (SEQ. ID. NO:50) : 

5 ' -GCCGTCGACCTATTAAGTTATTGCCATAGGAAG-3 • 

The sense primer, OLIGO#29, included an Xbal 
site and* a consensus Kozak translation sequence (5*- 

25 CCACC-3') upstream of the start codon, ATG. The 

antisense primer, OLIGO#30, included a Sail cloning 
site and an additional stop codon. After 18 cycles of 
PCR amplification (30 sec. denaturation at 94'C, 40 sec- 
annealing at 55-C, and 40 sec. elongation at 72*0, the 

30 product was digested with Xbal and Sail and ligated with 
a similarly digested DNA. of pDSRa2 (according to the 
methods of Bourdrel et al. (1993), Protein Exp. & 
Purify 1:130-140 and Lu et al. (1992), Arch. Biochem. 
Biophys., 224:150-158) . This resulted in plasmid 

35 KGF/pDSRa2 which placed the human KGF gene between the 
SV40 early promoter and the a-FSH polyadenylation 
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sequences. Two clones were picked and DNA sequence 
analysis confirmed construction of the desired vector. 

Two micrograms of KGF/pDSRxx2 DNA were then 
linearized with Pvul. Chinese hamster ovary (CHO) 
5 cells, seeded the day before at 0.8 x 10 6 cells/60 mm 
culture dish, were then transfected with the treated DNA 
using a standard calcium phosphate precipitation method 
(Bourdrel et al., supra). Two weeks later, individual 
colonies were picked and transferred into 24-well 
10 plates. The conditioned media was considered serum- free 
when the cells reached, conf luency and aliquots thereof 
were analyzed by Western blotting using a polyclonal 
rabbit antiserum reactive against E. coli-expressed 
human KGF. 

15 Westerns were performed by running samples 

through 12.5% (w/v) SDS polyacrylamide gels, followed by 
electroblotting for 1 hr. at 400 mA onto nitrocellulose 
membranes using a semi dry transfer apparatus (Hoefer 
Scientific Instruments, San Francisco, CA) . 20 mM Tris, 

20 150 mM glycine, 20% methanol served as the transfer 
buffer. The nitrocellulose sheets were blocked by 
incubation with 10% normal goat serum in PBS. Rabbit 
anti-serum raised against E. coli-derived KGF was used 
as primary antibocfy. For use, it was diluted 1/10,000 

25 in 1% normal goat serum in PBS 'and incubated with the 
blocked nitrocellulose sheets for 12 hr. at room 
temperature, after which excess antibody was removed by 
three 30 min. washes in PBS. The nitrocellulose 
membranes were then incubated in 100 mL of 1% normal 

30 goat serum in PBS containing Vectastaitf* biotinylated 
goat anti-rabbit IgG (secondary antibody, Vector Labs, 
Burlingame, CA) , f r 30 minutes at room temperature. 
After three 10 minute washes in PBS, a 30 minute room 
temperature incubation was perf rmed in a 100 mL 

35 solution of 1% normal goat serum containing streptavidin 
and biotinylated peroxidase, prepared according to 
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manufacturer's directions (Vector Labs) . Following 
three washes in PBS, KG? cross-reactive material was. 
visualized by incubation in a mixture of 60 jiL of 30% 
(w/v) H2O2 in 100 mL of PBS and 50 mg of 4-chloronapthol 
5 in 20 mL of methanol. The reaction was stopped by- 
rinsing in water after 10 minutes. 

Analysis of the blots revealed that the KGF- 
specific antibody associated with three distinct protein 
bands, two being closely related with molecular weights 

10 of about 25-29 kDa and one with an estimated molecular 
weight of about 17 kDa, as compared to the expected 
molecular weight of approximately 18.8 of the 163 amino 
acid mature protein. Additionally, several high 
expressing clones secreting more than 2.0 mg of rKGF per 

15 liter, as judged by Western analysis, were selected and 
expanded into roller bottles (according to the method of 
Lu et al., supra) to generate large volumes of serum- 
free conditioned medium for purification of KGF by 
cationic exchange chromatography and gel filtration, as 

20 set forth below. 

KGF from 3 L of serum-free conditioned medium 
was purified applying the medium directly to a cation 
exchange : coluran (5 x 24 cm) packed with 450 mL of 
sulfoethyl column of SP-Sepharose Fast Flow (Pharmacia) 

25 pre-equilibrated with 20 mM sodium phosphate, pH 7.5. 
After washing with five column volumes of 20 mM sodium 
phosphate, 0.2 M NaCl, pH 7.5, rKGF was eluted using a 
20 column volume linear gradient of 0.2 to 1.0 M NaCl in 
20 mM sodium phosphate, pH 7.5.. 50 mL fractions were 

30 collected with continuous A28O monitoring. KGF protein 
was detected by analyzing aliquots of each fraction by 
SDS-PAGE. SDS-PAGE was performed on an electrophoresis 
system (Novex, San Diego, CA) using precast 14% 
Tris-glycine precast gels (acc rding to the method of 

35 Laemmli (1970), Nature, 221:680-685). Samples were 
mixed with non-reducing SDS sample buffer without 
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heating before loading. The proteins were detected by 
either Cooxnassie blue or silver staining. Two late- 
eluting peaks were seen to contain protein bands 
corresponding to the 25-29 kDa and 17 kDa bands detected 
5 by Western blot. The fractions containing each of these 
peaks were separately concentrated to a volume of less 
^ a n 1.0 mL and subjected to gel filtration. 

The gel filtrations employed columns of 
Superdexr75 lw resin (HR 10/30, Pharmacia) prer- 

10 equilibrated with PBS, pH 7.2, and calibrated with the 
following known molecular weight standards (BioRad, San 
Francisco, CA) : thyroglobulin (670 kDa), gammaglobulin 
(158 kDa), ovalbumin (44 kDa), myoglobin (17 kDa) and 
vitamin B-12 (1.4 kDa) . These purification steps 

15 resulted in an approximate 2000-fold purification of 
rKGF, specifically including a 17 kDa and a 30 kDa 
material, as estimated by silver stai nin g. 

In the instance of the higher molecular weight 
material, rKGF eluted as a major symmetrical peak, which 

20 was called KGF-a. Upon SDS-PAGE analysis of a' lesser 

amount of this material, 3 jig/lane versus 6 ng/lane, two 
bands with a 1-2 kDa molecular weight difference were 
resolved. In the instance of the lower molecular weight 
material, termed KGF-b, gel filtration resulted in a 

25 protein preparation having the expected mobility. For 
both KGF-a and KGF-b, the overall yield after 
purification was approximately 30-40%. 

Amino acid sequences from KGF-a and KGF-b were 
also analyzed. These analyses were performed on an 

30 automatic sequencer (Model 477A or 47 OA, Applied 

Biosys terns, Inc., Foster City, CA) equipped with a Model 
12 OA on-line PTH-amino acid analyzer and a Model 900A 
data collection system (according to the method of Lu et 
al. (1991), *J. Biol. Chem., 2££: 8102-8107 ) . Edman 

35 sequence analysis of KGF-a revealed a major N- terminal 
sequence of Xi-N-D-M-T-P-E-Q-M-A-T-N-V-X 2 -X 3 -S- (SEQ ID 
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NO: 51) . A minor sequence starting from the third N- 
terminal amino acid, aspartic acid, was also present in 
1.6% of the total sequenceable protein. Xi, X 2 , and X3 
were the unas signed due to the absence of 
5 phenyl thiohydantoinyl (PTH) amino acid signals during 
sequence analysis. 

Interestingly, N- terminal sequence analysis of 
KGF-b revealed an N-terminal amino acid sequence of 
S-Y-D-Y-M-E-G-G-D-I-R-V- (SEQ ID NO: 52), indicating that 
10 it is an N-terminally truncated form of KGF that has 

been proteolytically cleaved at the Arg 23 -Ser 24 peptide 
bond. 

To further characterize purified KGF-a and 
KGF-b, the protein was subjected to glycosidases * 

15 (neuraminidase, O-glycanase, and/or N-glycanase) , using 
known techniques (Sasaki et al. (1987), J. Biol.- Chem. # 
252:12059-12076; Takeuchi et al. (1988), J. Biol. Chem., 
263:3657-3663: Zsebo et al. (1990), Cell, £2:195-201). 
These data indicate that KGF-a contains N- and 0-linked 

20 carbohydrates, although the lower molecular weight form 
of KGF-a probably contains only N-linked sugar. 
Glycosidase treatment did not cause molecular weight 
reduction* for KGF-b, indicating that the molecule is 
unglycosylat ed . 

25 

Example 4 : Biological Activity 

Each KGF analog was diluted and assayed for 
biological activity by measuring the [ 3 H] -thymidine 

30 uptake of Balb/MK cells (according to the method of 
Rubin et al. (1989), supra) . The samples were first 
diluted in a bioassay medium consisting of 50% customer- 
made Eagle's MEM, 50% customer-made F12, 5 jig/mL 
transferrin, 5 ng/mL sodium selenite, 0.0005% HSA and 

35 0.005% Tween 20. KGF samples were then added into 
Falcon primeria 96-well plates seeded with Balb/MK 
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cells. Incorporation of [3h] -Thymidine during DNA 
synthesis was measured and converted to input native KGF 
concentration by comparison to a native KGF standard 
curve. Each of the tested analogs exhibited mitogenic 
5 activity - 

Interaction with the KGF receptor was examined 
using isolated KGF receptor membrane preparations 
prepared from Balb/MK mouse epidermal keratinocytes (by 
the procedure described by Massague (19932) , *7- Biol. 

10 Cbezn., 25&: 13614-13620) . Specifically, various forms of 
KGF were diluted with 50 mM Tris-HCl, pH 7.5, containing 
0.2% bovine serum albumin so as to range in 
concentration from 0.8 ng to 100 ng per 50 jiL. They 
were individually incubated with the membrane 

15 preparation (75 ng/mL) and 125 I-labeled E. coli-derived 
KGF (1.5 ng) . Receptor binding and competition 
experiments were performed at 4 # C for 16 hr., after 
which time samples were taken, centrifuged, and washed 
twice with the above diluent buffer to remove unbound 

20 and non-specif ically bound, labeled KGF. Samples were 
then counted for the remaining radioactivity. 
Competition curves for receptor binding between KGF 
samples and labeled KGF were constructed by plotting 
percent uncompetition versus concentrations of each KGF 

25 sample. Radioreceptor assay uncompetition experiments 
indicated that E* coli-derived KGF, KGF- a, and KGF-b 
have similar receptor binding activity. 

While the present invention has been described 
above both generally and in terms of preferred 

30 embodiments, it is understood' that other variations and 
modifications will occur to those skilled in the art in 
light of the description above. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Amgen Inc. 



(ii) TITLE OF INVENTION: Method for Purifying Keratinocyte 

Growth Factors 

(iii) NUMBER OF SEQUENCES: 52 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Amgen Inc. 

(B) STREET: 1840 DeHavilland Drive 

(C) CITY: Thousand Oaks 

(D) STATE: California 

(E) COUNTRY: U.S.A. 

(F) ZIP: 91320-1789 



(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/487,830 

(B) FILING DATE: 

(C) CLASSIFICATION: not yet known 



(2) INFORMATION FOR SEQ ID NO:l: 



(i) SEQUENCS CHARACTERISTICS: 

(A) LENGTH: 862 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown, 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
CAATCTACAA TTCACAGATA GGAAGAGGTC AATGACCTAG GAGTAACAAT CAACTCAAGA 
TTCATTTTCA TTATGTTATT CATGAACACC CGGAGCACTA CACTATAATG CACAAATGGA 
TACTGACATG GATCCTGCCA ACTTTGCTCT ACAGATCATG CTTTCACATT ATCTGTCTAG 
TGGGTACTAT ATCTTTAGCT TGCAATGACA TGACTCCAGA GCAAATGGCT ACAAATGTGA 
A C T G TTCCAG CCCTGAGCGA CACACAAGAA GTTATGATTA CATGGAAGGA GGGGATATAA 
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GAGTGAGAAG ACTCTTCTGT CGAACACAGT GGTACCTGAG GATCGATAAA AGAGGCAAAG 360 

TAAAAGGGAC CCAAGAGATG AAGAATAATT ACAATATCAT GGAAATCAGG ACAGTGGCAG 420. 

TTGGAATTGT GGCAATCAAA GGGGTGGAAA GTGAATTCTA TCTTGCAATG AACAAGGAAG 480 
GAAAACTCTA TGCAAAGAAA GAATGCAATG AAGATTGTAA CTTCAAAGAA CTAATTCTGG . 540 

AAAACCATTA GAACACATAT GCATCAGCTA AATGGACACA CAACGGAGGG GAAATGTTTG 600 

TTGCCTTAAA TCAAAAGGGG ATTCCTGTAA GAGGAAAAAA AACGAAGAAA GAACAAAAAA 660 

CAGCCCACTT TCTTCCTATG GCAATAACTT AATTGCATAT GGTATATAAA GAACCCAGTT 720 

CCAGCAGGGA GATTTCTTTA AGTGGACTGT TTTCTTTCTT CTCAAAATTT TCTTTCCTTT 780 

TATTTTTTAG TAATCAAGAA AGGCTGGAAA AACTACTGAA AAACTGATCA AGCTGGACTT 840 

GTGCATTTAT GTTTGTTTTA AG 862 
(2) INFORMATION FOR SEQ ID NO:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 194 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met His Lys Trp lie Leu Thr Trp lie Leu Pro Thr Leu Leu Tyr Arg 
1 5_ .10 15 

Ser Cys Phe His He He Cys Leu Val Gly Thr lie Ser Leu Ala Cys 
20 25 30 

» 

Asn Asp Met Thr Pro Glu Gin Met Ala Thr Asn Val Asn Cys Ser Ser 
35 40 45 

Pro Glu Arg His Thr Arg Ser Tyr Asp Tyr Met Glu Gly Gly Asp He 
50 55 60 

Arg Val Arg Arg Leu Phe Cys Arg Thr Gin Trp Tyr Leu Arg He Asp 
65 70 75 80 

Lys Arg Gly Lys Val Lys Gly Thr Gin Glu Met Lys Asn Asn Tyr Asn 
85 90 95 

He Met Glu lie Arg Thr Val Ala Val Gly He Val Ala He Lys Gly 
100 105 HO 

Val Glu Ser Glu Phe Tyr Leu Ala Met Asn Lys Glu Gly Lys Leu Tyr 
US 120 125 
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Ala Lys Lys Glu Cys Asn Glu Asp 
130 135 



Cys Asn Phe Lys Glu Leu lie Leu 



140 



Glu Asn His Tyr Asn Thr Tyr Ala 
145 150 



Ser Ala Lys Trp Thr His Asn Gly 




Gly Glu Met Phe Val Ala Leu Asn 
165 



Gin Lys Gly lie Pro Val Arg Gly 
X70 X75 



Lys Lys Thr Lys Lys Glu Gin Lys 



Thr Ala His Phe Leu Pro Met Ala 
185 190 



180 



lie Thr 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 595 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 

ATCGATTTGA TTCTAGAAGG AGGAATAACA TATGAAAAAG CGCGCACGTG CTATCGCCAT 60 

TGCTGTGGCT CTGGCAGGTT TCGCAACTAG TGCACACGCG TGCAATGACA TGACTCCAGA 120 

GCAAATGGCT ACAAATGTGA ACTGTTCCAG CCCTGAGCGA CACACAAGAA GTTATGATTA 180 

CATGGAAGGA GGGGATATAA GAGTGAGAAG ACTCTTCTGT CGAACACAGT GGTACCTGAG 240 

GATCGATAAA AGAGGCAAAG TAAAAGGGAC CCAAGAGATG AAGAATAATT ACAATATCAT 300 

GGAAATCAGG ACAGTGGCAG TTGGAATTGT GGCAATCAAA GGGGTGGAAA GTGAATTCTA 360 

TCTTGCAATG AACAAGGAAG GAAAACTCTA TGCAAAGAAA GAATGCAATG AAGATTGTAA 420 

CTTCAAAGAA CTAATTCTGG AAAACCATTA CAACACATAT GCATCAGCTA AATGGACACA 480 

CAACGGAGGG GAAATGTTTG TTGCCTTAAA TCAAAAGGGG ATTCCTGTAA GAGGAAAAAA 540 

AACGAAGAAA GAACAAAAAA CAGCCCACTT TCTTCCTATG GCAATAACTT AATAG 595 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 186 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Lys Lys Arg Ala Arg Ala lie Ala He Ala Val Ala Leu Ala Gly 
1 5 10 15. 

Phe Ala Thr Ser Ala His Ala Cys Asn Asp Met Thr Pro Glu Gin Met 
20 25 30 

Ala Thr Asn Val Asn Cys Ser Ser Pro Glu Arg His Thr Arg Ser Tyr 
35 40 45 

Asp Tyr Met Glu Gly Gly Asp He Arg Val Arg Arg Leu Phe Cys Arg 
50 55 60 

Thr Gin Trp Tyr Leu Arg He Asp Lys Arg Gly Lys Val Lys Gly Thr 
65 70 75 80 

Gin Glu Met Lys Asn Asn Tyr Asn He Met Glu He Arg Thr Val Ala 
85 90 95 

Val Gly He Val Ala He Lys Gly Val Glu Ser Glu Phe Tyr Leu Ala 
100 105 110 

Met Asn Lys Glu Gly Lys Leu Tyr Ala Lys Lys Glu Cys Asn Glu Asp 
115 120 125 

Cys Asn Phe Lys Glu Leu He Leu Glu Asn His Tyr Asn Thr Tyr Ala 
130 135 140 

Ser Ala Lys Trp Thr His Asn Gly Gly Glu Met Phe Val Ala Leu Asn 
145 150 155 160 

Gin Lys Gly He Pro Val Arg Gly Lys Lys Thr Lys Lys Glu Gin Lys 
16i . 170 175 

Thr Ala His Phe Leu Pro Met Ala He Thr 
180 185 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 499 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

TATGTGCAAT GACATGACTC CAGAGCAAAT GGCTACAAAT GTGAACTGTT CCAGCCCTGA 60 

GCGACACACA AGAAGTTATG ATTACATGGA AGGAGGGGAT ATAAGAGTGA GAAGACTCTT 120 

CTGTCGAACA CAGTGGTACC TGAGGATCGA TAAAAGAGGC AAAGTAAAAG GGACCCAAGA 180 



WO 96/11952 



PCT/US9S/13099 



- 34 



GATGAAGAAT AATTACAATA TCATGGAAAT CAGGACAGTG GCAGTTGGAA TTGTGGCAAT 240 

CAAAGGGGTG GAAAGTGAAT TCTATCTTGC AATGAACAAG GAAGGAAAAC TCTATGCAAA 300 

GAAAGAATGC AATGAAGATT GTAACTTCAA AGAACTAATT CTGGAAAACC ATTACAACAC 360 

ATATGCATCA GCTAAATGGA CACACAACGG AGGGGAAATG TTTGTTGCCT TAAATCAAAA 420 

GGGGATTCCT GTAAGAGGAA AAAAAACGAA GAAAGAACAA AAAACAGCCC ACTTTCTTCC 480 

TATGGCAATA ACTTAATAG 499 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 164 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Cys Asn Asp Met Thr Pro Glu Gin Met Ala Thr Asn Val Asn Cys 
1 5 10 15 

Ser Ser Pro Glu Arg His Thr Arg Ser Tyr Asp Tyr Met Glu Gly Gly 
20 25 30 

Asp lie Arg Val Arg Arg Leu Phe Cys Arg Thr Gin Trp Tyr Leu Arg 
35 40 45 

He Asp Lys Arg Gly Lys Val Lys Gly Thr Gin Glu Met Lys Asn Asn 
50 * ■ 55 60 

Tvr Asn He Met Glu He Arg Thr Val Ala Val Gly He Val Ala He 
65 .70 75 80 

Lys Gly Val Glu Ser Glu Phe Tyr Leu Ala Met Asn Lys Glu Gly Lys 
85 90 95 

Leu TSrr Ala Lys Lys Glu Cys Asn Glu Asp Cys Asn Phe Lys Glu Leu 
100 105 110 

He Leu Glu Asn His Tyr Asn Thr Tyr Ala Ser Ala Lys Trp Thr His 
115 120 125 

Asn Gly Gly Glu Met Phe Val Ala Leu Asn Gin Lys Gly He Pro Val 
130 135 140 

Arg Gly Lys Lys Thr Lys Lys Glu Gin Lys Thr Ala His Phe Leu Pro 
145 150 155 "° 

Met Ala He Thr 
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(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7 
CAATGACCTA GGAGTAACAA TCAAC 
(2) INFORMATION FOR SEQ ID NO:8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8 
AAAACAAACA TAAATGCACA AGTCCA 
(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUaiCE CHARACTERISTICS : 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown » 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9 
ACAACGCGTG CAATGACATG ACTCCA 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 10: 
ACAGGATCCT ATTAAGTTAT TGCCATAGGA A 31 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
ACACATATGT GCAATGACAT GACTCCA 27 
(2) INFORMATION FOR SEQ ID NO: 12: . 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
CTGCGTATCG ACAAACGCGG CAAAGTCAAG GGCACCC 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: ' 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
AAGAGATGAA AAACAACTAC AATATTATGG AAATCCGTAC TGTT 



44 
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(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
GCT GTTC GTA TCGTTGCAAT CAAAGGTGTT GAATCTG 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
TCTTGGGTGC CCTTGACTTT GCCGCGTTTG TCGATACGCA GGTAC 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45* base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown , 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
ACAGCAACAG TACGGATTTC CATAATATTG TAGTTGTTTT TCATC 
(2) INFORMATION FOR SEQ ID NO:17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
AATTCAGATT CAACACCTTT GATTGCAACG ATACCA - 3$ 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:18: 
A GT T T T G ATC TAGAAGGAGG 20 
(2) INFORMATION FOR SEQ ID NO: 19: ; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
TCAAAACTGG ATCCTATTAA 20 
(2) INFORMATION FOR SEQ ID NO:20: 

(i) SEQUENCE CHARACTERISTICS: ' 

(A) LENGTH: 91 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
AGTTTTGATC TAGAAGGAGG AATAACATAT GTGCAACGAC ATGACTCCGG AACAGATGGC 
TACCAACGTT AACTGCTCCA GCCCGGAACG T 



60 
91 
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(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:21: 
CACACCCGTA GCTACGACTA CATGGAAGGT GGTGACATCC GTGTTCGTCG TCTGTTCTGC 60 
CGTACCCAGT GGTACCTGCG TATCGACAAA 90 
(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
CGTGGTAAAG TTAAAGGTAC CCAGGAAATG AAAAACAACT ACAACATCAT GGAAATCCGT 60 
ACT G TTGCTG TTGGTATCGT TGCAATCAAA 90 
(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: . 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUE NCE DESCRIPTION: SEQ ID NO:23: 
GGTGTTGAAT CTGAATTCTA CCTGGCAATG AACAAAGAAG GTAAACTGTA CGCAAAAAAA 
GAATGCAACG AAGACTGCAA CTTCAAAGAA 



60 
90 
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(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 
CTGATCCTGG . AAAACCACTA CAACACCTAC GCATCTGCTA AATGGACCCA CAACGGTGGT 60 
GAAATGTTCG TTGCTCTGAA CCAGAAAGGT 90 
(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 88 base pairs 

(B) TYPE: nucleic acid ; 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
ATCCCGGTTC GTGGTAAAAA AACCAAAAAA GAACAGAAAA CCGCTCACTT CCTGCCGATG 60 
GCAATCACTT AATAGGATCC AGTTTTGA 88 
(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUEICE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs ^ 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 

20 

TACGGGTGTG ACGTTCCGGG 
(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 
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(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27 
CTTTACCACG TTTGTCGATA 
(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28 
ATTCAACACC TTTGATTGCA 
(2) INFORMATION FOR SEQ ID NO:29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29 
CCAGGATCAG TTCTTTGAAG 
(2) INFORMATION FOR SEQ ID NO:30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUEJCE DESCRIPTION: SEQ ID NO: 30 
GAACCGGGAT ACCTTTCTGG 
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(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 495 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unJcnown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



495 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

ATGTCTAATG ATATGACTCC GGAACAGATG GCTACCAACG TTAACTCCTC CTCCCCGGAA 60 

CGTCACACGC GTTCCTACGA CTACATGGAA GGTGGTGACA TCCGCGTACG TCGTCTGTTC 120 

TGCCGTACCC AGTGGTACCT GCGTATCGAC AAACGCGGCA AAGTCAAGGG CACCCAAGAG 180 

ATGAAAAACA ACTACAATAT TATGGAAATC CGTACTGTTG CTGTTGGTAT CGTTGCAATC 240 

AAAGG TG TTG AATCTGAATT CTACCTGGCA ATGAACAAAG AAGGTAAACT GTACGCAAAA 300 

AAAGAATGCA ACGAAGACTG CAACTTCAAA GAACTGATCC TGGAAAACCA CTACAACACC 360 

TACGCATCTG CTAAATGGAC CCACAACGGT GGTGAAATGT TCGTTGCTCT GAACCAGAAA 420 

GGTATCCCGG TTCGTGGTAA AAAAACCAAA AAAGAACAGA AAACCGCTCA CTTCCTGCCG 480 
ATGGCAATCA CTTAA 

<2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 164 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unJcnown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

Met Ser Asn Asp Met Thr Pro Glu Gin Met Ala Thr Asn Val Asn Ser 
1 5 10 « 

Ser Ser Pro Glu Arg His Thr Arg Ser Tyr Asp Tyr Met Glu Gly Gly 
20 25 30 

Asp He Arg Val Arg Arg Leu Phe Cys Arg Thr Gin Trp Tyr Leu Arg 
35 40 45 

He Asp Lys Arg Gly Lys Val Lys Gly Thr Gin Glu Met Lys Asn Asn 
50 55 60 
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Tyr Asn lie Met Glu lie Arg Thr Val Ala Val Gly lie Val Ala lie 
65 70 75 80 

Lys Gly Val Glu Ser Glu Phe Tyr Leu Ala Met Asn Lys Glu Gly Lys 
85 90 95 

Leu Tyr Ala Lys Lys Glu Cys Asn Glu Asp Cys Asn Phe Lys Glu Leu 
100 105 110 

lie Leu Glu Asn His Tyr Asn Thr Tyr Ala Ser Ala Lys Trp Thr His 
115 120 125 

Asn Gly Gly Glu Met Phe Val Ala Leu Asn Gin Lys Gly He Pro Val 
130 135 140 

Arg Gly Lys Lys Thr Lys Lys Glu Gin Lys Thr Ala His Phe Leu Pro 
145 150 155 160 

Met Ala He Thr 



(2) INFORMATION FOR SEQ ID NO:33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 495 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: 

ATGTGCAATG ATATGACTCC TGAACAAATG GCTACCAATG TCAACTGTTC CTCTCCGGAG 60 

CGCCACACCC GGAGTTACGA TTACATGGAA GGTGGGGATA TTCGCGTACG TCGTCTGTTC 120 

TGCCGTACCC AGTGGTACCT GCGTATCGAC AAACGCGGCA AAGTCAAGGG CACCCAAGAG 180 

ATGAAAAACA ACTACAATAT TATGGAAATC CGTACTGTTG CTGTTGGTAT CGTTGCAATC 240 

AAAGGTGTTG AATCTGAATT CTATCTTGCA ATGAACAAGG AAGGAAAACT CTATGCAAAG 300 

AAAGAATGCA ATGAAGATTG TAACTTCAAA GAACTAATTC TGGAAAACCA TTACAACACA 360 

TATGCATCTG CTAAATGGAC CCACAACGGT GGTGAAATGT TCGTTGCTCT GAACCAGAAA 420 

GGTATCCCTG TTCAAGGTAA GAAAACCAAG AAAGAACAGA AAACCGCTCA CTTCCTGCCG 480 

ATGGCAATCA CTTAA 4 95 
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(2) INFORMATION FOR SEQ ID NO:34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 164 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

Met Cys Asn Asp Met Thr Pro Glu Gin Met Ala Thr Asn Val Asn Cys 
X 5 10 15 

Ser Ser Pro Glu Arg His Thr Arg Ser Tyr Asp Tyr Met Glu Gly Gly 
20 25 30 

Asp He Arg Val Arg Arg Leu Phe Cys Arg Thr Gin Trp Tyr Leu Arg 
35 40 « 

He Asp Lys Arg Gly Lys Val Lys Gly Thr Gin Glu Met Lys Asa Asn 
50 55 60 . 

Tyr Asn He Met Glu He Arg Thr Val Ala Val Gly He Val Ala He 
65 70 75 80 

Lys Gly Val Glu Ser Glu Phe Tyr Leu Ala Met Asn Lys Glu Gly Lys 
85 90 95 

Leu Tyr Ala Lys Lys Glu Cys Asn Glu Asp Cys Asn Phe Lys Glu Leu 
100 105 H° 

He Leu GluTAsn His Tyr Asn Thr Tyr Ala Ser Ala Lys Trp Thr His 
US 120 125 

Asn Gly Gly Glu Met Phe Val Ala Leu Asn Gin Lys Gly He Pro Val 
130 135 I 40 

Gin Gly Lys Lys Thr Lys Lys Glu Gin Lys Thr.Ala His Phe Leu Pro 
145 ISO 155 lt,u 

Met Ala He Thr 
(2) INFORMATION FOR SEQ ID NO:35: : 



(i) SE QUENCE CHARACTERISTICS: 

(A) LEJGTH: 495 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

ATGTCTAATG ATATGACTCC GGAACAGATG GCTACCAACG TTAACTCCTC CTCCCCGGAA 60 

CGTCACACGC GTTCCTACGA CTACATGGAA GGTGGTGACA TCCGCGTACG TCGTCTGTTC 120 

TGCCGTACCC AGTGGTACCT GCGTATCGAC AAACGCGGCA AAGTCAAGGG CACCCAAGAG 180 

ATGAAAAACA ACTACAATAT TATGGAAATC CGTACTGTTG CTGTTGGTAT CGTTGCAATC 240 

AAAGGTGTTG AATCTGAATT CTATCTTGCA ATGAACAAGG AAGGAAAACT CTATGCAAAG 300 

AAAGAATGCA ATGAAGATTG TAACTTCAAA GAACTAATTC TGGAAAACCA TTACAACACA 360 

TATGCATCTG CTAAATGGAC CCACAACGGT GGTGAAATGT TCGTTGCTCT GAACCAGAAA 420 

GGTATCCCTG TTCAAGGTAA GAAAACCAAG AAAGAACAGA AAACCGCTCA CTTCCTGCCG 480 

ATGGCAATCA CTTAA 495 
(2) INFORMATION FOR SEQ ID NO:36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEKGTH: 164 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

Met Ser Asn Asp Met Thr Pro Glu Gin Met Ala Thr Asn Val Asn Ser 
1 5 : : 10 15 

Ser Ser Pro Glu Arg His Thr Arg Ser Tyr Asp Tyr Met Glu Gly Gly 
20 25 * 30 

Asp He Arg Val Arg Arg Leu Phe Cys Arg Thr Gin Trp Tyr Leu Arg 
35 40 45** 

He Asp Lys Arg Gly Lys Val Lys Gly Thr Gin Glu Met Lys Asn Asn 
50 55 60 

Tyr Asn He Met Glu He Arg Thr Val Ala Val Gly He Val Ala He 
65 70 : 75 80 

Lys Gly Val Glu Ser Glu Phe Tyr Leu Ala Met Asn Lys Glu Gly Lys 

as so 95 

Leu Tyr Ala Lys Lys Glu Cys Asn Glu Asp Cys Asn Phe Lys Glu Leu 
100 105 HO 

He Leu Glu Asn His Tyr Asn Thr Tyr Ala Ser Ala Lys Trp Thr His 
115 120 125 
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Asn Gly Gly Glu Met Phe Val Ala lieu Asn Gin Lys Gly lie Pro Val 
130 135 140 

Gin Gly Lys Lys Thr Lys Lys Glu Gin Lys Thr Ala His Phe Leu Pro 
145 150 155 160 

Met Ala lie Thr 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 450 base pairs 

(B) * TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

ATGTCTTCTC CTGAACGTCA TACGCGTTCC TACGACTACA TGGAAGGTGG TGACATCCGC 60. 

GTACGTCGTC TGTTCTGCCG TACCCAGTGG TACCTGCGTA TCGACAAACG CGGCAAAGTC 120 

AAGGGCACCC AAGAGATGAA AAACAACTAC AATATTATGG AAATCCGTAC TGTTGCTGTT 180 

GGXATC G TTG CAATCAAAGG TGTTGAATCT GAATTCTACC TGGCAATGAA CAAAGAAGGT 240 

AAACTGTACG CAAAAAAAGA ATGCAACGAA GACTGCAACT TCAAAGAACT GATCCTGGAA 300 

AACCACTACA ACACCTACGC ATCTGCTAAA TGGACCCACA ACGGTGGTGA AATGTTCGTT 360 

GCTCTGAACC AGAAAGGTAT CCCGGTTCGT GGTAAAAAAA CCAAAAAAGA ACAGAAAACC 420 

GCTCACTTCC TGCCGATGGC AATCACTTAA 450 
(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 149 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

Met Ser Ser Pro Glu Arg His Thr Arg Ser Tyr Asp Tyr Met Glu Gly 
IS 10 15 

Gly Asp He Arg Val Arg Arg Leu Phe Cys Arg Thr Gin Trp Tyr Leu 
20 25 30 
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Arg lie Asp Lys Arg Gly Lys Val Lys Gly Thr Gin Glu Met Lys Asn 
35 40 45 

Asn Tyr Asn lie Met Glu He Arg Thr Val Ala Val Gly He Val Ala 
50 55 60 

He Lys Gly Val Glu Ser Glu Phe Tyr Leu Ala Met Asn Lys Glu Gly 
65 70 75 80 

Lys Leu Tyr Ala Lys Lys Glu Cys Asn Glu Asp Cys Asn Phe Lys Glu 
85 90 95 

Leu He Leu Glu Asn His Tyr Asn Thr Tyr Ala Ser Ala Lys Trp Thr 
100 105 no 

His Asn Gly Gly Glu Met Phe Val Ala Leu Asn Gin Lys Gly He Pro 
115 120 125 

Val Arg Gly Lys Lys Thr Lys Lys Glu Gin Lys Thr Ala His Phe Leu 
130 135 140 

Pro Met Ala He Thr 
14S 

(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 426- base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:39: 

ATGTCCTACG ACTACATGGA AGGTGGTGAC ATCCGCGTAC GTCGTCTGTT CTGCCGTACC 60 

CAGTGGTACC TGCGTATCGA CAAACGCGGC AAAGTCAAGG GCACCCAAGA GATGAAAAAC 120 

AACTACAATA TTATGGAAAT CCGTACTGTT GCTGTTGGTA TCGTTGCAAT CAAAGGTGTT 180 

GAATCTGAAT TCTACCTGGC AATGAACAAA GAAGGTAAAC TGTACGCAAA AAAAGAATGC 240 

AACGAAGACT GCAACTTCAA AGAACTGATC CTGGAAAACC ACTACAACAC CTACGCATCT 300 

GCTAAATGGA CCCACAACGG TGGTGAAATG TTCGTTGCTC TGAACCAGAA AGGTATCCCG 360 

GTTCGTGGTA AAAAAACCAA AAAAGAACAG AAAACCGCTC ACTTCCTGCC GATGGCAATC 420 
ACTTAA . 



426 
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(2) INFORMATION FOR SEQ ID NO:40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 141 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

Met Ser Tyr Asp Tyr Met Glu Gly Gly Asp He Arg Val Arg Arg Leu 
15 10 15 

Phe Cys Arg Thr Gin Trp Tyr Leu Arg He Asp Lys Arg Gly Lys Val 
20 25 ; 30 

Lys Gly Thr Gin Glu Met Lys Asn Asn Tyr Asn He Met Glu He Arg 

35 40 45 i 

Thr Val Ala Val Gly He Val Ala He Lys Gly Val Glu Ser Glu Phe 
50 55 60 

Tyr Leu Ala Met Asn Lys Glu Gly Lys Leu Tyr Ala Lys Lys Glu Cys 
65 70 75 80 

Asn Glu Asp Cys Asn Phe Lys Glu Leu He Leu Glu Asn His Tyr Asn 
85 90 95 

Thr Tyr Ala Ser Ala Lys Trp Thr His Asn Gly Gly Glu Met Phe Val 
100 105. HO 

Ala Leu Asn^ln Lys Gly He Pro Val Arg Gly Lys Lys Thr Lys Lys 
115 120 125 

Glu Gin Lys Thr Ala His Phe Leu Pro Met Ala He Thr 
130 135 1*0 



(2) INFORMATION FOR SEQ ID NO:41: 

(i) SE Q UEN CE CHARACTERISTICS: 

(A) LENGTH: 426 base pairs 

(B) TYPE: nucleic acid 

(CJ STRANDEDNESS: unknown; 
(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

ATGTCCTACG ACTACATGGA AGGTGGTGAC ATCCGCGTAC GTCGTCTGTT CTGCCGTACC 60 

CAGTGGTACC TGCGTATCGA CAAACGCGGC AAAGTCAAGG GCACCCAAGA GATGAAAAAC 120 

AACTACAATA' TTATGGAAAT CCGTACTGTT GCTGTTGGTA TCGTTGCAAT CAAAGGTGTT 180 

GAATCTGAAT TCTATCTTGC AATGAACAAG GAAGGAAAAC TCTATGCAAA GAAAGAATGC 240 

AATGAAGATT GTAACTTCAA AGAACTAATT CTGGAAAACC ATTACAACAC ATATGCATCT 300 

GCTAAATGGA CCCACAACGG TGGTGAAATG TTCGTTGCTC TGAACCAGAA AGGTATCCCT 360 

GTTCAAGGTA AGAAAACCAA GAAAGAACAG AAAACCGCTC ACTTCCTGCC GATGGCAATC 420 

ACTTAA 426 
(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 141 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:42: 

Met Ser Tyr Asp Tyr Met Glu Gly Gly Asp He Arg Val Arg Arg Leu 
15 10 15 

Phe Cys Arg Thr Gin Trp Tyr Leu Arg He Asp Lys Arg Gly Lys Val 
20 25 30 

Lys Gly Thr Gin Glu Met Lys Asn Asn Tyr Asn He Met Glu He Arg 
35 40 ' 45 

Thr Val Ala Val Gly He Val Ala He Lys Gly Val Glu Ser Glu Phe 
50 55 60 

Tyr Leu Ala Met Asn Lys Glu Gly Lys Leu Tyr Ala Lys Lys Glu Cys 
65 70 75. 80 

Asn Glu Asp Cys Asn Phe Lys Glu Leu He Leu Glu Asn His Tyr Asn 
'85 90 95 

Thr Tyr Ala Ser Ala Lys Trp Thr His Asn Gly Gly Glu Met Phe Val 
100 105 HO 

Ala Leu Asn Gin Lys Gly He Pro Val Gin Gly Lys Lys Thr Lys Lys 
US 120 125 

Glu Gin Lys Thr Ala His Phe Leu Pr Met Ala He Thr 
130 135 140 
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(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

<ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43 
GAGCTCACTA GTGTCGACCT GCAG 
(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME /KEY: - 

(B) LOCATION: complement (1..24) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44 
CTGCAGGTCG ACACTAGTGA GCTC 
(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: , 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45 
CAATCTACAA TTCACAGA 
(2) INFORMATION FOR SEQ ID NO:46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) t/fnetH: 18 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46 
TTAAGTTATT GCCATAGG 
(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47 
AACAAAGCTT CTACAATTCA CAGATAGGA 
(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48 
AACAAGATCT TAAGTTATTG CCATAGG 
(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUGJCE DESCRIPTION: SEQ ID NO: 49 
CGGTCTAGAC CACCATGCAC AAATGGATAC TGACATGG 
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(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO:50: 
GCCGTCGACC TATTAAGTTA TTGCCATAGG AAG 
(2) INFORMATION FOR SEQ ID NO:51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

Xaa Asn Asp Met Thr Pro Glu Gin Met Ala Thr Asn Val Xaa Xaa Ser 
1 5 10 15 



(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

Ser Tyr Asp Tyr Met Glu Gly Gly Asp lie Arg Val 
1 5 1° 
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WHAT IS CLAIMED IS: 

1. A method for purifying a keratinocyte growth 
factor (KGF) # the method comprising: 

5 a) obtaining a solution comprising KGF; 

b) binding KGF from the solution of part (a) to a 
cation exchange resin; 

c) eluting KGF in an eluate solution from the 
. .cation exchange resin; 

10 d) " passing the eluate solution from part (c) 

through an appropriate molecular weight 
exclusion matrix; and 
e) recovering KGF from the molecular weight 
exclusion matrix. 

15 

2. The method according to Claim 1 wherein the 
KGF is produced in procaryotic cells. 

3. The method according to Claim 1 wherein the 
20 KGF is produced in E. coli. 

4. The method according to Claim 1 wherein the KGF 
is produced in* mammalian cells. 

25 5 . The method according* to Claim 4 wherein the 

KGF is produced in Chinese hamster ovary cells. 

6. A method for purifying a keratinocyte growth 
factor (KGF) # the method comprising: 
30 a) obtaining a solution, comprising KGF; 

b) binding KGF from the solution of part (a) to a 
cation exchange resin; 

c) eluting KGF in an eluate solution from the 
cati n exchange resin; 
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d) performing hydrophobic interaction 
chromatography on the eluate solution of part 
(c) ; and 

e) recovering KGF from the hydrophobic 

s interaction chromatography step of part (d) . 

7 A method according to Claim 6 further 
comprising oxidation of free sulfhydryl groups in KGF. 

10 'a. The method according to Claim 6 wherein the 

KGF is produced in procaryotic cells. . 

9. The method according to Claim 7 wherein the 
KGF is produced in E. coli. 
15 10. The method according to Claim 6 wherein the . 

KGF is produced in mammalian cells. 

U The method according to Claim 10 wherein the 
20 KGF is produced in Chinese hamster ovary cells. 
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Figure 1 

human KG? (+ signal sequence) 

I— OLXGO*25 1 | ■QtXSO»X~ 1 

5 ' CAAXC7 ACAATTGACAGA 3* 5 ' GAATGACCTAGSAGTAACAATCAAC 3* 

5 'CAATCTACAATTCACAGATAGSAAGAGGTCAATGACCTAGGA^^ 

1 I 1 — i 1 ► 50 

-rTCATTTTCATTATGTTATTCATGAACACCaSGAGO^ 

1 — : H *— < — - 120 

M H X H I 

-TACTGACATGGATCCTGCCAACTTTGCTCTACAGATCXTGCTT^ 

► ■■■» ■■»■■■ » ■ > ♦ tao 

LTWILPTLLYRSCTHI XCLV 

-TGGGTACTATA ICI T IA G C I T GCA ATGACATGACTCCAGAGCAAATGGCrACAAATGTGA~ 

¥m » 1 1 » ■ ■ ■ — < ► 240 

6 T • I • S t K C MOM T P E Q MAT N V M 

-ACTGTTCCAGCCCTGAGCGACACACAAGAAGTTAT^ 

»■■■■■■■■■■».■ > ■ » f 300 

CSSPERHTR SrOTKEGGDXR 

-GAGTGAGAAGACtCI ICT G I C GAACAGAGTGGTACCTGAGGATCGATAAAACAGGCAAAG- 

> * » » — — ► 360 

VRRLrCRTQ WTLRIDKRGKV 

- T AAAAGGGACCCAAGAGAXGAAGAATAATT ACAAT ATCATGGA 

» ■ ■ ■ .» ■ ■> i » 420 

XGTQ&MKMNYNXK& XRTVAV 

~TTGGAAgTGTGGCAATCAAAGGGGTGGAAACTGAATTCT ATC I T GC AATGAACAAGGAAG- 

■ ■ » ■ * ■■■■» ■ > ■ . ■ - » ■ ■ ■ » ■ ■ f 480 

G XVAXlCGVSSSrr&AKNKSG 

HSAAAACTCTATGCAAAGAAAGAATGCAATGAAGATTGTAA^ 

■»■■■■ ■ ► ■■ ■ » ■■ . ti »■ , . . - f S40 

R X> TAKKECHEOC MrKEL X X, E 

- AAAACCAT TA C A A ^ A TATATGCAICAGCTAAATGGACACACAACGGAGGGGAAATGTTTG^ 

■ ■ ■ ■ , > ■ ■ i » ► 600 

HHTHTTAS AXWTHHGGEMFV 

* 

I i * » 660 

A L If Q X G X PVRGXR T XXX Q XT 

-CAfiCCCA L J ! X TCI XCC T A TGGCAATAACTTAATTGCATATGGTATATAAAGAA CC CAGTT 

■ » ■ ■ , ■ > ■> —4 1 ► 720 

AHFLPM AIT* 

3 * GGATACCGTT ATTGAATT 5' 
I— CiIGO#26 1 
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(continued) 

H:CAGCAGGGAGA TT ' XCI Z lA AGTGGa CrS TtTTC T TTCTTCrCAAA A TTTI CIXT CCTTT 
■■■ ■ »■ ■■- — +~ — -» — ^ +-~ + 780 

- T ATTTTTT AGT AATCAAGAAAGGC7GGAAAAACT ACXGAAAAAC7GATCAAGCTGGACTT 
, H h —i + 84fl 

3 'ACCTGAA- 
| 

862 



-CACGTAAATACAAACAAAA 5' 
— OLIGO#2 1 
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6/15 
fiquxm 3 

RSH-KG? 

plaaid DMA CUX JEbaX NdmZ 

3«quanc« 5 ' -AXCSATrTGATTCTAGAAGGAGGAATAACATATGAAAAAG- 

M K K 

R3H signal 3«qu«nc« MluX 
-CGCGCXCCTGCTATCGCCATTGCTGTGGCTCTGGC^ . 

RARAIAtAVALAGrATSAHA- 
KluZ 

5 9 CGCGTGCAATGACATGACTCCAGAGCAAATGGCTACAAATGTGAACXG 

1 1 1 > ' — ♦ 60 

-CNDMTPEQMATNVNCS SPE 

H3CGACACXdXW3AAGTTATGArTACATGGAAGGAGG(^TAT 

-+ 1 1 » h— _+ 120 

RKTRSTOYMEGGOXRVRRLF 

Kpnl ClMl 

-CITnCGAACACAGTGGTACCTGAGGAXCGATAAAAGAGGCAAAGTAA 

" ■ ' M> " ■ ■> 11 1 ■ > — ► iao 

CRTQHXLRIDKRGlCVKGTQ.g 

HjATGAAGAATAATTACAATATCATGGAAATCAGGACAGTGGCAGTT^ 

■■-» » 1 » ■ - > I 240 

MRMHTNZMEXRTV.AVGXVAZ 

EcoJtT 

^CAAAGGGGTGGAAAGTGAATTCTAXCTTGCAAXGAACAAGGAAGGA 
»■ f -» » ■ .. ^ ► 3Q0 

KGVESErTZ.AMKKEGKI.YAK 

BamI 

HUAAGMTGCAA3CUAGAXTGTAACTTCXAJUUAC?AAT7C7GG^ 

• » " »■■ - ■ ► ■ I 360 

KECHE DCMrKEtlLEMHTHT 

McteX 

- ATAXGCAXCXGCTAAATGG A C X CACA> CG GAGGGGAA AI ' Vi 1 1 IU1 XUXXTA AAXCAAAA- 

■ » ■ » ■ ■■ ■ » » ■ ■■■ ■ » " ■ » 420 

TASAKWTHMGGEMrVALNQK 

"GGGGATTCCTGTAAfi3USGAAXAAAAACGAAGAAAGAACAAA^ 

■ ■»■■■■ ■> ■ ■■» ■■ ,--» 480 

G IPVRGKKTKKEQKTAHFLP 

BaaffX 

-TAXGGCAATAACTTAA2AG 3' -piaamid OKA 

■ ■ ■■».■■ 503 -3«quuc« 

M A X T • * 
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Tigur* 4 

KG? 

Hdml 

S ■ TATGTGCAATCACATGACTCCAGAGCAAATGGCTACAAATGTGAJVCTGTTCC^ 

' » " 1 > + < fl 

MCMDMTPEQMATMVMCSSPE 



^ GACACACXAGAAGTTATGATTAC ATGGAACGAGGGGATAra 

RHTRSTDtME GGOl RVRRLr 



l» aX CZaX 

^C^CGXACACAGTGCTACCTGAflGAT CGATAAAAGACGCXAAgTA 

CRTQWX&RXDXRGXV K G T Q ~E + 

-GATGAAG^TAATT ACAATATCATGGAAATCAGGACACTGGCA^ 

M1CH MXHZHE XRTVAVG I V A~i + 

EcoRX 

-CAAAGGCTOGGAAAGTGAAXTCTATCTTGCAATGAJ^^ 

H * I — I 1 + 30Q 



KGVESEFYLAMNKEGKLYA 



K 



BsmX 

*<3AAAGAATGCaATCAA6ATTGTAACTTCAAA6AACTJU 

1 ■ i 1 I i ► 360 

KECMEDCNFKELX &EHHYNT 

UdmX 

- ATATGCATCAGCrAAATGGACACACAACGGAGGGGAA A I G I XT G I TU CCTTAAATCAAAA- 

► 1 ■ ■' -* : ► ■ < + 420 

TASAKW7HNGGEMFVALHQR 

HSGGGATTCCTGTAAGAGGAAAAAAAACGAAGAAAGAACAAAAAAC A GC C^ 

» ' 1 I i „ 430 

GIPVRGKXTKKEQ ICTAHFtP 

: ' BtmffX 
-TATGGCAATAACTTAATAG 3* 

> i 503 

M A I T • 
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rignx* 5 
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substitution of KpnX to EcoRI sequence to make KG?(dsd) 
KpnX 

\ 0&IC0»6 "" 1 ! — OLIG0#7 > 

5 * CTGCCTATCGACAJLACGCGGCA^ 

3 • CATgGACGCATAGCTGTTTGCGCCSTTTCASTTCCCSTCGGTtCTCT AC I I J! X lUX ' JU ATGTTA- 

I 0tIG0#9 1 I— OiXGOUO 

-LRIDKRGKVKGTQEMKNNTN- 

JScoJU 

1 | -OLXG0»8 -| 

5 • -ATTATGGAAATCCGTACTGTTGCTGTTGGXATCGTTGCXATCAAAGGTGTTGAATCTG 3 • 

3 ' ^TAATACCTTTAGGCATGACAACGACAACCATAGCAACGTTA OI 1 1 C CXCAACTTAfiACTTAA 5* 

1 I 0LZG0#11 1 

I M B--I RTVAVG I V A I K G V E S E - 
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9/15 



PCT7US95/13099 



Tiquxm 6 

KGF (codon optimized) 

I 0LIG0#12— — I 

5 ' AGTTTTGAXCT AGAAGGAGG 3 9 

|— 0LXG0#14- 



5 ' AGTTTTGATCTAGAAGGAGGAATAACATAXGTGCAACGACATGACTCCGG 



•I I QLIGOU5- 



-ACOUlCGTTAACTGCTCCAGCCrGGAACGTCACACa 

3* GGG CC I TG CAGTGTGGGCAT 5' 
I — OLIGO#20 1 



-OLIGO#15 1 |- 



-GTGACATCC G X GI T CGJCG I C I G IT CrG CCGTACCCAGTGGTACCTGCGTATCGACAAACG- 

3 • ATAGCTGTTTGC- 
|-OLIGO#21~ 



-OUGOU6- 



-TGGTAAAGTTAAAGGTACCCAGGAAATGAAAAACAACTACAACATCATGGAAATCCCT 

-ACCATTTC 5 • 
1 



•I | 0LIGO#17- 



rAXCGTTGCAATCAAAGGTGTTGAATCTGAAXTCrACCTGGCAATGAACA— 
3* ACGTTA G TTT CCA CAACTTA V 
I OLIGO#22 1 



-OLIGO#X7 : ! f I ■ 



-AAGAAGGTAAACTGTACGCAAAAAAAGAAXGCAACGAAGACTGCAAC 

3 * GAA G I TT C I IG ACTA- 
|— OLXGO#23 



-OLIGOK3- 



-CCTGGAAAACCACTAOUKACCTACGCAT^^ 
-GGACC 3' 



.| | ■ ■ OIIGOf 1S« 



3' UU fUt X 1CCA TAGGGCCAAG V 
|— OLIGO#24— — I 

-OUGOfl O" ■ " 1 

CTTAATAGGATCCAGTTTTGA 3' 

AATTATCCTAGGTCAAAACT 3* 

-OLXGO#13 1 

Buffi 



WO 96/11952 



PCT/US93/13099 



10/15 

Figure 7 
KG? CU,15)S 

5 f ATGTCTAATGAJATGACTCCGGAACAGATGGCTACCAACGTrAACICCTCCTCCCCGGAA- 

1 > ' i v i I 60 

M5NDMTPSQMA7NVN5SSPS 



^GTCACACGCGTTCCTACGACTACATGGAAGGTGGTGACATCCGCCTA CG TCG IC IG rTC- 
■ — — »■ ■ ■■■ » ■ ■ ► i 



RHTRSTDTMEGGD I RVRRL F 
-TGCCGTACCCAGTGGTACCTGCGM 



+ 120 



» . . » +~ + ► iao 

CRTQWTLRIDKRGKVKGTQ2 

- ATGAAAAACAA C TACAATAITATGGAAATCCGTA CTG TT GC I G T' I GCT A T CG I TG CAATC- 

► -» ¥ » » ■ 240 

M K N K TWIMEIRT V A V G I V A I 

-AAAGGTGTTGAA2CTGAACTCTACCTGGCAATGAACAAASAAGGTAAACT 

» ■ » ■ h 300 

KGVSSEF7LAMNKSGK.LYAR 

-AAAGAASGCAACGAACSACTGCAACTTOUU^ 

», ■■» ■> ■■>■ ■ ■ » 360 

KECNEDCNFXELILENHYNT 



-TACGCATCTGCTJ 



fGTJCG r T GCTCnjA A CC AGAAA* 

> ' » 420 



■ f » i 

TASAKWTHMGGEM'.rVAtMQK 

-GGTATCC CGG IT CG I GG I A AAAAAA Ca UUUUUUa ^ ^ 

■ i ■ ■ ' » ■ ■ ■ » ■ » ■■ ■ ■ ■ 

G IPVRGKK TKKEQKTAHFLP 

-ATGGCAATCACTTAA 3* 

», ■■■■■ 495 

M A I T • 



480 
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Figure 8 
KGr R{144)Q 

5 'ATOTGCAATGATATGACrCCTGAACAAATGGCTACCAATGTCAACTGir CC 'JLTCCGGA^ 
1 +— ► i , + fi0 

MCHDMTPEQMATNVMCSSPE 

^GCCAjCACCCGGAgTTACGATTACATGGAAGGTGGGGATATTCGCgTACGT CG rCTUriC* 

i ■ ■* 1 „ i2Q 

RKTRSYDYMEGGO I R V R R L r 

-TGCCGTACCCAGTGGTACCTGCGTATCGACAAACGCGG^^ 
■ ■» , ♦ ■ ■ » . ■ , _ 1 k 

CRTQWYLRIDKRGICVK GTQE 

- ATGAAAAACAACTACAATATTA1GGAAATCCGTA C T G I T GC T G IT GG TATCGTTGCAATC- 

► i ♦ ■ ■ » 240 

MXK WTNIMEIRTV AVG IVAI 

-AAAGGTGTTGAAXCTGAATTCTATCTTGC^TGAACAAGGAAGGAAAACTCTATGCAAA^ 

»■ >■■■ ■ »■■■■■ ■»■ — ► ■ ■ 300 

KGVESErTLAMMKEGKLYAX 

-AAAGAATGCAAXGAAGArTGTAACTTCAAAGAACTAArrCT 
» ■ ■ » f j h 360 

KECWEDCNFKEI.II.EHH YNT 

-TATGCATCTGCTAAATGGACCCACAACG G TG < :^^ 

■ i ■ ■ ■ > ■ ■ ■ » ■ » ■ I 1 420 

YASAKWTHMGGEM FVALWQK 

HjGTAT CCCTG XTCAAGGTAAGAAAACCAAGAAAGAACAGAAA A C ^ 

■ ■ » ■ ■ ► i ■■ ■ ■■ ■ » 480 

GIPVQGKKTKXEQKTAHrLP 

-ATGGCAATCACTTAA 3* 
I 495 

M A I T * 



■ 
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Figure 9 
KGF C(1,15)S/RU44)Q 

5 ' ATGTCTAATGATATGACTCCGGAACASATGGCTAC^^ 

1 ► ■ ■■ ► -f- ► fi0 

MSHOMTPSQMATKVMSSSPe 

H33TCACACGCSrrCCTACGACTACAXGGAAGGTGGT^^ 
r 4~ 1 > uo 



RHTRSTDYMEGGDIRVRR 



ii r 



-TGCCSTXCCCAGTOGTA CC T GCG T A TCGACAAACGCGGCAAAgTCAAGGGCaCC^^ 
> > » I -| , 

CRTQKTLRIDKRGKVKGTQE 



180 



rA ICG TT G CAATC* 
+ 240 



M X M M YMIMSIRT V A V G I V A I 

rTGCAATGAACAAGGAAGGAAAACTCTATGCAAAO- 

1 I ■ » » — 

KGVESErXLAMMXSGK L Y A K 



-AAXGGrGITGAAXCTGAATXCI* 



300 



"AAAGAATGCAATGAAGATTOTAACTTCAAAGAACTAAITCTfwG AAA ACCATTACAACACA- 

■■■-., ) ■ , ■ , 



KECNEDCMFKELILEMHYMT 



* 360 



SAACCAGAAA— 

420 



YASAKWTHMGGEM rVALHQK 



HSCXAIICC C I G XT CA AG CT AAfiAAAACCAAGAAAGA A CAflAAAA CC GC 

» ■ ' ■ * I ■■»■■■ « »■ » ■ ■ 



GIPVQGKKTKKEQKTAHFLP 

-ATGGCAAXCACTTAA 3 f 

1 495 

M A I T * 



-»> 430 
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Figure 10 
KGF AN1S 

5 • AIGTCTTCTCCTGAACGTCATACGCGrrTCCTACGACT^ 

■■ ' » 1 ' 1 + «0 

M33PERHTRSrDYMEGGDIR 

^GTXC G T CG TCT U I TC I GCCG TACCCACTGGTACCTGCGTATCGACAAACGCGGCAAAGTC^ 
1 1 1 1 » : ► 12Q 




-AAGGGCACCCAAGAGAXGAAAAACAACTACAATATTATGGAAATCCSTJ 

■ ■ I — » — ■ ■ 

XGTQSMKHNXNZME ZRTVAV 

-GGTATC G TTGCAATCAAJ U^ I G rr G AAXCTGAATra 

— ■ ■» * » > », ■ » 240 

G Z V. .A Z XGVE S E r T.& A M M X E G 

-AAACTGTACGCAAAAAAAGAATGCAACGAAGACTGCAACTTCAAAG^ 

> ■ ■ ■ » ■ » ■ ■ - » ■ ■ » ■ ■ ) 300 

XtrAXXECWEDCHFKEL ZLE 

^AACCACTACAACACCTACGCATCTGCTAAArGGACCCACAACG G T GG T G AA A 

■ ■ ■ I ■ ■» > > ■ > ■ ■ > 360 

NHTHTTA3 AXWTHNGGSMFV 

*GCTCIGAACCAG AA AGGTATCCCS GI T CGZGGIA AAAAAACC AA AAAAGAACAGAAAACC* 

» ■ ■ l ■ ■ »■ »-■■ » ■■! 420 

AtiMQXGZPVRGXX. TXXEQXT 

tAA 3* 
-+ 430 



AHFLPMAIT 



WO 9*11952 M/15 PCT/US9S/13099 

Figure 11 . 
KGF AN23 

5 • A7 G T C CTACGACTACATGGAAGGTGGTGACATCCGCGTACG7 CG T C T GI T C 

1 I > ■ ■ i > » $0 

M5YDXMSGGD IR VRRLFCRT 

-<AGTGCTACCTGCGTATCGACAAACGC^^ 
QWYLRIDKRGKVKGTQEM KN 

-AACTACAATATTATGGAAATCCGTA CIG r TGCXG I T G GT AICG rTGC AATCAAAGGTGTT- 

i > « > ■ ■ ■ ■ ■ ■ ■ I -» 180 

NYNIMEIRTVAVGIVAriCGV 

-GAATCTGAATTCTACCTGGCAAJGA A CAAAGAAGCT 

>, ■ ■ . ► ( ■< ► — + 240 

S 3 t r X L A H N K S G R.L T A X K S C 

-AACGAAGACTGCAACTTCAAAGAACTGAXCCTGGAAAACCACTACAACACCT^ 



300 




AXWTBMGGEMrVALHQKGX? 



360 



■ HjI T CGXGG I A AAAAAACCAAAAA A GAACAGAA A ACCGCTCA CI t CCIGCCG ATGGCAATC* 

' » ■ » ■ ■ ■ ■ ■ ■ » ■■■■■ ■ ■ > ■ ■ ■■■ ■+ 420 

V R G K K T !C K E Q K T A H F I. P M A I 

-ACTTAA 3' 
426 
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Figure 12 
KGF AN23/R(144)Q 

5 ■ ATGTCCTACGACTACAgGGAAGGTCGTGAC*TCCGC^ 

I 1 1 ► , , „ M 

MSTOTMSGGOXRVKRXiFCRT 

H3CTGGTJUXTGCGTATCGACAAACGCGGC^^ 

1 1 1 1 I „ 120 

QWrtRXDXRGXVXGTQXMXN 

— AAC TA rAA TA TTATGGAAATCCGTA CIG I Jil^lU JX GfflP A T gGf fM l l ^ATC\AA SGMl ' J - 

1 1 1 » i ■ ► 180 

HTNXMSXRTVAVGXVAXKGV 

► ■ ■ ■ ■ ■ ■» » > ■ 1. 240 

B3S rrLAMHKXGX Z.TA lCKSC* 

-AATGAAgATTGTAACTTCAAAGAACTAAXTCTGGAAAA 

■ 1 ► ■ » ■ ■ ■ , , » , , , | 300 

MEOCNrXCLXLSMBTHTTAS 

^i* 0 *^* 2 ^ 3M 

AKWTHHGGSMrVALtfQKGXP 

-GTTCAAGGTAAGAA AA CC AA GAAAGAACAG A AMjC CG CTC AC^ 

> * 1 1 » ■ I 420 

VQGKXTKKEQXT A H T L F M A X 

-ACXTAA 3 f 
42€ 
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